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Preface 


beet sence 


Epidemiology is widely recognized as the basic science of preven- 
tive medicine. This recognition and the greatly increased public 
support of medical research have found expression in this coun- 
try in a striking increase in the number of research studies and 
programs of an epidemiologic type. Naturally, this increase has 
been most marked for those diseases over which greatest concern 
is now being felt — arteriosclerotic heart disease, cancer and the 
mental disorders. While there has been repeated expression of 
faith in the applicability of the principles of epidemiologic investi- 
gation to noninfectious diseases, there is no doubt that the trans- 
ference of epidemiologic concern from the infectious to the 
noninfectious diseases has necessitated appreciable modification of 
methods, if not of principles. We hope, therefore, that this attempt 
to summarize concepts that seem to us particularly appropriate to 
the present time will be of service. 

Concern with the present by no means implies disregard for in- 
fectious disease. Outstanding problems remain unsolved in this 
field. In addition, our frequent use in this volume of illustrations 
from studies of infectious disease is testimony to the fact that the 
present status of epidemiology as a discipline must be credited in 
major part to its achievements in this area. 

In spite of the increasing use of biostatistical techniques in epi- 
demiologic studies, we have avoided, so far as possible, the duplica- 
tion of material that is readily found in textbooks of biostatistics. 
We believe that the understanding of the principles discussed here 
does not require statistical knowledge. However, the reader in- 
tending to put some of these principles into practice would do 
well to sentence himself to a term with a book of elementary 
statistics, the sentence preferably to run concurrently with the 
present one. 
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Whoever wishes to investigate medicine properly should proceed 
thus: in the first place to consider the seasons of the year, and what 
effects each of them produces. Then the winds, the hot and the 
cold, especially such as are common to all countries, and then such 
as are peculiar to each locality. In the same manner, when one 
comes into a city to which he is a stranger, he should consider tts 
situation, how it lies as to the winds and the rising of the sun; for 
its influence is not the same whether it lies to the north or the 
south, to the rising or to the setting sun. One should consider most 
attentively the waters which the inhabitants use, whether they be 
marshy and soft, or hard and running from elevated and rocky 
situations, and then if saltish and unfit for cooking; and the 
ground, whether it be naked and deficient in water, or wooded 
and well watered, and whether it lies in a hollow, confined situ- 
ation, or is elevated and cold; and the mode in which the in- 
habitants live, and what are their pursuits, whether they are fond 
of drinking and eating to excess, and given to indolence, or are 


fond of exercise and iabor. 
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On Airs, Waters and Places 
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Epidemiologic Methods 


1 
The Place of Epidemiology in the 


Investigation of Disease 


DEFINITION 


Epidemiology is the study of the distribution and determinants 
of disease prevalence in man. 

‘T'wo main areas are indicated in the definition. ‘These are the 
study of the distribution of disease (descriptive epidemiology) and 
the search for the determinants of the noted distribution (ana- 
lytic epidemiology). The first, describing the distribution of health 
Status in terms of age, sex, race, geography, etc., might be 
considered an extension of the discipline of demography to health 
and disease. The second, involving interpretation of the distribu- 
tion in terms of possible causal factors, is the special contribution 
of epidemiology. Definitions of epidemiology that consider only 
the descriptive phase do not indicate the unique component of 
the discipline. 

Epidemiology, like most other sciences, has developed from the 
study of the exotic and the unusual into an inquiry as to general 
principles, including explanation of the unusual. Epidemiology is 
now no more restricted to the study of striking outbreaks of dis- 
ease than meteorology is restricted to the study of hurricanes or 
astronomy to eclipses of the sun. Yet a modern epidemiologist 
might still consider his vocation to be primarily a “medical 
science treating of epidemics” (Webster) in view of two recent 
trends in the practice of the discipline. These are an expansion 
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of the concept of what constitutes an epidemic, and a realization 
that study designed to interpret epidemic patterns cannot be re- 
stricted to periods of epidemic prevalence. 


The Concept of an Epidemic 


The term “epidemic” has until quite recently been almost en- 
tirely restricted to the description of acute outbreaks of infec- 
tious diseases. However, an acceptable definition of an epidemic 
might be “the occurrence in a community . . . of a group of ill- 
nesses of similar nature, clearly in excess of normal expectation” 
(American Public Health Association, 1960). This definition en- 
compasses phenomena ranging from the classical pestilences to 
episodes in the behavior of many noninfectious diseases. The 
United States, for example, is at the present time in the grip of 
epidemics of at least two diseases — coronary heart disease and 
lung cancer — which satisfy as well as any infectious disease the 
criterion of being clearly in excess of normal expectation. Lung 
cancer is many times more common in this country than it was 
twenty years ago, and, although it is now the most common form 
of cancer in American males, there are many countries where it 
remains rare. 


Epidemic and Nonepidemic Prevalence 


A second trend has been growing realization of the value of 
knowledge of disease frequency and distribution during both epi- 
demic and nonepidemic times, even when the predominant con- 
cern may be with the explanation of epidemics. 

In the first place, without knowledge of nonepidemic preva- 
lence, how can we know that an epidemic exists? How can we 
determine whether the frequency of a disease present in a partic- 
ular population at a particular time is in excess of normal ex- 
pectancy? Clearly, in order to demonstrate that a particular 
frequency of morbidity or mortality is in excess of the normal ex- 
pectancy, it is necessary to know the frequency of the same con- 
dition in other populations and in the same population at other 
times. 

In certain instances the existence of an epidemic is perfectly 
obvious. This is so when the epidemic involves a large proportion 
of persons, produces evidence of distinctive illness, and occurs 
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over a short period of time. Thus there is little difficulty in de- 
tecting epidemics of cholera, plague, smallpox or the common 
infections of childhood. In all these cases the disease is familiar, 
the difference between epidemic and nonepidemic prevalence 1s 
large, and the transition is rapid. In contrast, the current risk to 
an American male of dying of coronary heart disease is quite as 
large as the risk of death experienced during some of the major 
epidemics of infectious disease, but the general population re- 
mains almost unaware of the existence of an epidemic of coronary 
heart disease. The slow growth of the epidemic of this disease has 
served to conceal its enormity from the population. 

_ Even acute epidemics may pass unnoticed if they appear in un- 
familiar form. Thus, during the intense London fog of 1952 there 
was very limited realization of the effects of the fog on the popula- 
tion’s health. The full effects were only appreciated when deaths 
for the period were counted and compared with deaths during 
the preceding and subsequent periods of the same year and dur- 
ing similar periods of previous years (Figure 1). It then became 
apparent that the fog was responsible for some 4000 deaths. 

Second, finding of an unusually low frequency in a particu- 
lar population at a particular time may be just as significant in 
understanding the causes of epidemics as the epidemics them- 
selves. For example, the very low attack rates from cholera ob- 
served by John Snow (1855) among two groups of people in the 
center of an otherwise epidemic area led to a strengthening of the 
belief that the water supply was responsible for the epidemic. 
Similarly, “knowledge of the virtual absence of cancer of the 
uterine cervix in nuns is of value in the formulation of hypothe- 
ses regarding the etiology of this disease. | 

Finally, in the chronic diseases, which have prolonged up- 
swings and downswings of the epidemic wave, it may be difficult 
to say whether or not a given frequency qualifies as “epidemic” 
even if all the necessary comparative information is available. It 
is common to find a gradient in the relative frequencies of a dis- 
ease in a number of populations. While the disease may be con- 
sidered definitely epidemic in those populations at the top of the 
gradient by comparison with those at the bottom, the status of 
those in the center may be equivocal. Under such circumstances 
attempts to correlate quantitative statements of disease frequency 
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Ficure 1. Daily numbers of deaths and degree of atmospheric pollution, London, November 29 


to December 16, 1952. Atmospheric pollution is measured as parts per million of sulphur dioxide. 
(Data from the Ministry of Health, 1954.) 


ATMOSPHERIC POLLUTION 


© 
w 


The Place of Epidemiology in the Investigation of Disease 7 


with quantitative statements of the frequency of the suspected 
factors are more precise than attempts to correlate the crude di- 
chotomy of epidemicity and nonepidemicity with the crude di- 
chotomy of presence or absence of specific factors. The former 
procedure has been followed, for example, in relating character- 
istics of the diet to rates of coronary heart disease in different 
populations. 

These considerations have led to a great many disease fre- 
quency studies which, while having as an ultimate purpose the 
explanation of unusual prevalences of disease, are not oriented 
directly toward the study of situations known to be epidemic. 


PURPOSES 


Knowledge of the distribution of disease may be utilized in one 
of three general areas — the elucidation of causal mechanisms, 
the explanation of local characteristics of disease occurrence, and 
administrative guidance in the provision of health services. 


Understanding the Causation of Disease 


The most challenging purpose of epidemiology is that of iden- 
tifying those components of causal mechanisms that enable the 
formulation of effective preventive measures. This aim encom- 
passes a number of subsidiary, more specific purposes: 

(1) Formulation, selection, or rejection of hypotheses that 
explain disease distribution in terms of specific characteristics or 
experiences of affected persons. 

(2) Testing hypotheses of disease origin through special 
surveys or other observational studies. 

(3) Testing the validity of the rationale on which control 
programs are based, by the use of epidemiologic data collected 
in conjunction with the programs. A control program may be 
considered as an experimental design, the subject of the experi- 
ment being the knowledge of causal mechanisms which gave rise 
to the program. 

(4) Distinguishing “entities” of disease and disability 
through knowledge of disease distribution and of causal factors. 
On occasion, similarity of epidemiologic behavior may point to 
etiologic similarity between clinically distinct entities. Con- 
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versely, difference in epidemiologic distribution for subgroups 
of an apparent clinical entity may suggest that such subgroups 
should be regarded as separate disease entities because of proba- 
ble etiologic differences. 

Most frequently, epidemiologists are concerned with these aims 
as applied to the elucidation of the causation of disease. How- 
ever, processes of human biology other than those with an end 
product of manifest disease have been studied. These have in- 
cluded prenatal and postnatal growth, multiple pregnancy, sex de- 
termination, intelligence and fertility. 

It has been suggested that epidemiology should be concerned 
as much with those positive components of health implicit in 
the definition of the word as used by the World Health Organ- 
ization* as with those negative characteristics that are recognized 
as frank disease. However, the number of widespread and serious 
diseases having defined clinical and pathologic characteristics but 
an unknown etiology appears adequate to occupy the world’s 
epidemiologists for some years to come. Concentration of effort 
on these latter problems is indicated both because of the great 
need for knowledge leading to the prevention of these diseases 
and because of the practical difficulties in the statistical or epi- 
demiologic treatment of concepts that have been neither clini- 
cally nor pathologically defined. 


Understanding Local Disease Patterns 


The preceding purpose dealt with the acquisition of new 
knowledge about the origin and nature of a disease. In contra- 
distinction, the practicing epidemiologist is often concerned with 
specific local epidemics or other variations in disease frequency. 
Here he utilizes what is already known about the etiology of, say, 
typhoid fever, to explain and deal with an outbreak and to for- 
mulate preventive measures suitable to the particular community. 

Of course a fine distinction between these two approaches can- 
not be drawn, since new knowledge may be derived during 
the course of routine investigation. Nevertheless, in a great deal 
of “practical” epidemiology, new knowledge of disease causa- 
tion is not purposefully sought and its advent would be unex- 


*“Health is a state of complete physical, mental and social well-being and not 
merely the absence of disease or infirmity” (W.H.O., 1948). 
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pected. Typical of this practical epidemiology are the large num- 
bers of investigations of localized outbreaks of food poisoning and 
salmonellosis undertaken by departments of health with a view 
to finding the local source of infection, halting the epidemic, and 
preventing its recurrence. At the present time, because of in- 
adequacies of knowledge concerning other illnesses, such practical 
uses of epidemiology are largely confined to infectious diseases 
and certain industrial intoxications. 


Administrative Purposes 


Knowledge of the frequency of disease in a population serves a 
number of administrative purposes. For example, it is essential to 
the logical planning of facilities for medical care. Estimation of 
the number of hospital beds required for patients with specific 
diseases (tuberculosis, cancer) or for given segments of the popu- 
lation (infants, the aged) requires knowledge of the frequency 
and natural history of either those specific diseases or all diseases 
in those specific parts of the population. In addition, the plan- 
ning of a variety of studies related to health — clinical, thera- 
peutic, and preventive — requires, for efficiency of design, knowl- 
edge of how many cases of a particular disease are likely to be 
found in a given population during a given period. 

Knowledge of the relative frequencies of a disease among sub- 
groups of the population is also useful, since it enables programs 
and studies to be directed toward the population group manifest- 
ing the greatest concentration of the disease. For example, if fa- 
cilities are limited, it assists in deciding which age, occupational, 
sex, geographic or ethnic group should be the target for a tuber- 
culosis, diabetes or cancer screening program. Similar considera- 
tions apply to the choice of a subgroup of the population for any 
study, whatever its purpose, that requires the maximum yield of 
cases for a given size of population studied. 


RELATIONSHIP TO OTHER DISCIPLINES 


Two characteristics of epidemiology allow us to place this dis- 
cipline in a logical relationship to the many other disciplines con- 
cerned with understanding the causes of disease. These are that 
epidemiology is primarily an applied discipline, and that its 
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methods are predominantly observational, rather than experi- 
mental. 

Its classification as an applied discipline — that is, one con- 
cerned with the solution of immediate practical problems — im- 
plies that there are certain prerequisite disciplines, disciplines 
whose content is used by the epidemiologist as the starting point 
for his own observations. Needless to say, the content of practi- 
cally any discipline may be relevant to the epidemiologic study 
of specific problems, but there are three disciplines that are par- 
ticularly pertinent. The contributions of two, clinical medicine 
and pathology, to the accurate description and classification of 
disease form the basis for determination of disease frequency. 
The more clearly defined and delineated the disease or biologic 
process in question, the greater is the probability of successftil 
epidemiologic observation. The third discipline, biostatistics, is 
necessary for determining whether observed variations in disease 
frequency within and between groups of the population are more 
than fortuitous. 

The fact that the contributions of other disciplines are required 
in addition to those of epidemiology is implicit in the statement 
that the methods of epidemiology are predominantly observational. 
Since the most convincing test for causal relationship is usually 
through experiment (see Chapter 2), the methods of other disci- 
plines are needed for more critical examination of suspected causal 
relationships and for the investigation of their mechanisms. Epi- 
demiology does have a role in the experimental investigation of 
disease, but it is a limited one. 

The disciplines whose purview it is to undertake these experi- 
mental investigations vary with the subject area within which the 
suspected relationship falls. The most pertinent at the present 
time is microbiology. However, a great deal of experimental work 
in biochemistry, radiobiology, and experimental pathology is 
aimed at the explanation of associations noted through epidemi- 
ologic observation. Since observation is in general a great deal 
more economical than experiment, one of the purposes of epi- 
demiology may be considered as the identification and narrow- 
ing down of subject areas within which practical benefit is most 
likely to result from experimental investigation, 
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DEFINITION 


All descriptive sciences take note that certain events tend to be 
associated with each other in time. Some of these associations be- 
tween events have characteristics which lead us to speak of the 
relationship as cause and effect — the first event being denoted 
the “cause” of the second. The use of these words sometimes 
leads to the idea that the second event is the inevitable conse- 
quence of the first. This concept is now discredited by most 
schools of philosophy. The first doubts are usually ascribed to 
Hume (1739), who wrote: “We are never able, in a single in- 
stance, to discover any power or necessary connection, any quality 
which binds the effect to the cause, and renders the one an infal- 
lible consequence of the other. We only find that one does actu- 
ally, in fact, follow the other.” 

There is nothing inevitable about causal associations, and there 
is no logical reason to believe that because they have been demon- 
strated under conditions existing in the past, they will necessarily 
continue in the future. Of course, the longer they have been ob- 
served and the greater their constancy in the past, the stronger 
will be our belief in the likelihood of their continuation in the 
future, but such belief rests on empirical observations rather than 
on any logical process of deduction. 

Doubt as to the demonstrability of the basic nature of causal 
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relationships extends even to the observations of the physical 
sciences in which association between two events may be so strong 
as to seem inevitable within the limits of our experience. 

What then are the special] features of certain relationships 
which lead us to describe them as causal? We should note first 
that, the term being abstract, its meaning will not be universal. 
Any definition of it must take into account the purpose for which 
it is intended, and no definition will have equal validity in all 
branches of science. Epidemiology has a practical purpose in the 
discovery of relationships which offer possibilities of disease pre- 
vention. For this purpose, therefore, a causal association may be 
defined as an association between two categories of events in 
which a change in the frequency or quality of one is observed to 
follow alteration in the other. In certain instances the possibility 
of alteration must be presumed and a presumptive classification 
of an association as causal may be justified. Human genetics is a 
science based in its entirety on associations that can only pre- 
sumptively be classified as causal. 


TYPES OF ASSOCIATION 


To understand the implications of the use of the term “causal 
association,” it is necessary to describe some of the ways in which 
events may be related. With respect to each other, two categories 
of events* may be: 

(A) Independent (not statistically associated) 
(B) Statistically associated 
(1) Noncausally (secondarily) associated 
(2) Causally associated 
(a) Indirectly causal 
(b) Directly causal 

From the above, the investigation of a relationship can be seen 
to progress from demonstration of statistical association to demon- 
stration that the association is causal, and ultimately to ascertain- 
ment of its directness. 


* By a “category of events” is meant all the single evénts having the specific 
characteristics necessary to warrant their classification together; for example, “multi- 
ple sclerosis” as distinct from the single event “a case of multiple sclerosis.” 
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Statistical Association 


If a category of events has a frequency x and another category 
has a frequency y, the two types of events will occur together with 
a predictable frequency, equal in fact to the product of their 
individual frequencies, xy. Suppose one of these categories to bea 
disease and the other to be an attribute or experience of man: 
there will be a certain number of individuals who exhibit both 
the experience and the disease. While the single events may be 
said to be associated (in the nontechnical sense) in such individ- 
uals, these instances of coexistence are not evidence of statistical 
association. Statistical association, or merely “association” in the 
scientific sense, implies that the proportion of individuals exhibit- 
ing both events is either significantly higher or significantly 
lower than the proportion predicted on the basis of the separate 
frequencies of the two categories. If the two categories of events 
are each measured on quantitative scales rather than in terms of 
frequency, a correlation significantly different from zero is ac- 
cepted as evidence of association. 

While the content of the last paragraph is obvious enough, its 
main implication is not widely accepted. Since the demonstration 
of statistical association requires comparison of observed with 
predicted coexistence of categories of events, and since statisti- 
cal association is a necessary characteristic of cause, it is apparent 
that cause can only be determined in statistical terms. In other 
words, it is never possible to determine whether two single events 
are causally related; such determination can only be made with 
respect to categories of events. - 

For example, suppose that 100 persons are inoculated with a 
vaccine against a certain infectious disease and that 100 persons 
undistinguishable from the first group receive a placebo. During 
a subsequent epidemic, although the two groups have similar ex- 
posure, 20 of the vaccinated persons and 50 of the unvaccinated 
contract the disease. Since this difference is unlikely to be due to 
chance, we would say that a statistical association exists between 
vaccination and remaining free of the disease and, in view of 
criteria discussed below (page 15), that this association is proba- 
bly a causal one. However, it is not possible to say that vaccination 
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caused any individual person in the vaccinated series to remain 
disease-free, since there were instances of vaccinated persons who 
contracted the disease and of unvaccinated who remained free. 
Within the framework of these results, it is even possible that 
there were persons who contracted the disease because of vaccina- 
tion, even though the over-all tendency was in the opposite direc- 
tion. | 

Of course, information from a group experience may suggest 
the probability of the existence of causal association in an in- 
dividual instance. The stronger the association between the two 
categories of events revealed by the group experience, the more 
likely is the assumption of causal association in a specific instance 
to be correct. ‘Thus, if the disease frequency in the unvaccinated 
series had been 99 per cent and that in the vaccinated series | 
per cent, there would be a very high probability that the absence 
of disease in any one vaccinated individual was related to the vac- 
cination. The statement that the vaccination of any one individ- 
ual was causally related to his freedom from disease would, in 
such circumstances, be likely to be correct. The validity of the 
statement depends, however, on the total experience, and not on 
any observations made on this one individual (other than that he 
was one of those who were vaccinated and remained free of dis- 
ease). 


Causal and Secondary Association 


Only a minority of statistical associations are causal within the 
sense of the definition, which requires that change in one party 
to the association alters the other. The large number: of statisti- 
cal associations which do not satisfy this requirement may, for 
want of a better term, be described as secondary. Secondary asso- 
clations usually result from association of both categories of events 
with a third category. For example, if category C is causally as- 
sociated with both category A and category B (i.e., C precedes 
and influences both A and B), A and B will necessarily be asso- 
clated statistically. However, the association between A and B is 
noncausal, since there is no prospect of altering B by manipulating 
A. This type of association is common. For example, injection of 
neoarsphenamine (A) in outpatient clinics for venereal disease has 
been noted to be associated with jaundice — “‘salvarsan icterus”’ 
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(B). For a long time, as the name indicates, the drug was re- 
garded as the cause of the icterus, until it was discovered that the 
association was the result of causal association of both icterus and 
injection of neoarsphenamine with a third factor — treatment 
for syphilis (C). 

Once a statistical association has been demonstrated, how can 
it be determined whether or not it is causal — if A is altered, is B 
affected? The most satisfactory procedure is by direct experiment. 
However, this procedure, while freely available to the laboratory 
worker, is infrequently available to the epidemiologist. Quite 
apart from the general limitations on human experimentation, 
experiments in disease prevention are difficult to perform, fre- 
quently involve considerable disturbance of personal and social 
behavior, and are expensive. They are unlikely to be undertaken 
unless the probability of a favorable outcome is very high. In 
certain associations, the presumed cause may not be susceptible 
to manipulation and therefore to experiment; while knowledge 
of whether the relationship is or is not a causal one may in this 
event be of little immediate value, it may nevertheless be impor- 
tant from the standpoint of the development of knowledge. 

In the absence of experimental proof, three types of considera- 
tion are useful in distinguishing between causal and secondary 
associations: 

(1) Time sequence. Associations in which the related events 
occur simultaneously or with varying precedence cannot be con- 
sidered causal. In a causal relationship it is necessary that the 
sequence of individual events be consistent. In situations in which 
the sequence of events cannot be determined precisely ( a not un- 
common situation in chronic disease), at least the possibility of 
consistent sequence must exist for a relationship to be considered 
causal. 

(2) Strength of the association. ‘The stronger the association 
between two categories of events (for example, the higher the 
ratio of the incidence of B following A to the incidence of B 
without A), the more likely the association is to be causal. 

(3) Consistency with existing knowledge. Clinical, path- 
ologic and other knowledge of the disease in question and knowl- 
edge of other pathologic or physiologic effects of the supposed 
causal factor, leading to a judgment as to whether or not a causal 
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hypothesis is “reasonable,” play an important part in the accept- 
ance of such a hypothesis. On the other hand, collateral evidence 
may suggest a satisfactory explanation of the association other 
than as the result of a causal relationship. 

The assessment of a relationship as causal in the absence of 
direct experiment is neither easy nor objective. Differences of 
opinion resulting from subjective interpretation and assembly 
of evidence are common. Caution in presuming relationships to be 
causal is laudable. On occasion, however, such caution appears to 
be carried to an unrealistic extreme, the implications of sound 
evidence pointing to the causal nature of an established relation- 
ship being stubbornly resisted. In situations in which the deriva- 
tion of experimental proof is either impracticable or unethical, 
complete evidence will never be at hand, and it may be impru- 
dent to await it. 


Direct and Indirect Causal Association 


At the beginning of the previous section (page 14), we dis- 
cussed a statistical association between two categories of events 
that was explicable in terms of association of both with a third 
category. This type of secondary association must be distin- 
guished from causal associations in which a third variable oc- 
cupies an intermediate stage. Thus, if A is causally related to D 
(A being the cause and D the effect) and D is causally related to 
B (D the cause, B the effect), there will be a causal relationship 
between A and B, although the association is said to be indirect. 
For example, treatment for syphilis is not of itself productive of 
icterus, but it is one of the factors associated with use of unclean 
syringes. Further investigation of ‘salvarsan icterus” indicated 
that the unclean syringe component rather than the treatment of 
the syphilis was responsible for the icterus. However, since a cer- 
tain number of cases of icterus would presumably be prevented 
by failure to treat syphilis, the association of syphilis treatment 
with icterus is causal, although indirectly so. 

The distinction between direct and indirect causal relation- 
ships is a relative one. Actually, all causal relationships are in- 
direct, apparent directness depending on the limitations of cur- 
rent knowledge. In the example above, the association of icterus 
with syphilis treatment was indirect, and that with the use of 
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unclean syringes direct. However, further investigation revealed 
that the icterus was associated not with unclean syringes per se, 
but with the injection of small remnants of human serum asso- 
ciated with the use of unclean syringes. ‘This discovery resulted in 
a change of the name of the condition to “‘serum hepatitis.” How- 
ever, the icterus was further found to be associated directly, not 
with serum, but with the presence in the serum, on occasion, of 
hepatitis virus. Thus the association with the virus is currently 
the direct one and that with serum an indirect one. Further 
studies might in due course reveal what specific attributes or 
molecular components of the virus might be considered more 
direct causal factors. Knowledge of causal mechanisms will never 
be refined to the degree that makes it possible to state that this is 
the ultimate direct association and that no other, more direct asso- 
ciations intervene. 

A goal of complete knowledge would require study of causal 
associations until the most direct causal mechanisms that can be 
observed are identified (i.e., ad infinitum). However, the practi- 
cal significance of causal associations in the development of pre- 
ventive programs does not necessarily depend on the degree of 
directness. First, more direct associations may not yet have been 
identified and there may therefore be no choice but to make use 
of obviously indirect associations. For example, knowledge of the 
association of freedom from scurvy with diets containing fresh 
fruits and vegetables was put to practical use hundreds of years 
before the isolation of vitamin C, and prevention of smallpox 
certainly antedates modern virology by several centuries. A sim1- 
lar situation exists with respect to the association of cigarette 
smoking with lung cancer at the present time. Second, intermedi- 
ate direct causes may not be susceptible of economical alteration, 
even if known, whereas the indirect ones are. For example, pre- 
ventive measures against serum hepatitis are directed against 
poor syringe hygiene and not specifically toward removal of the 
hepatitis virus. Furthermore, even seventy years after the dis- 
covery of the microorganisms associated with enteric disease, 
preventive measures are directed, at least in the United States 
and Europe, primarily toward the provision of clean water and 
food. | | 
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THE WEB OF CAUSATION 


We have discussed the types of association that may exist be- 
tween two categories of events. In fact, effects are never depend- 
ent on single causes. The concept of “chains of causation” although 
common has the defect of oversimplification. In Figure 2 are 
shown some of the components that enter into the causal associa- 
tion between treatment for syphilis and serum hepatitis. When 
it is considered that only a few of the major components are 
shown, that these are indicated as broad classes of events rather 
than as the multiple minor events comprising each class, that each 
component shown is itself the result of a complex genealogy of 
antecedents, and that the myriad of effects of these components 
other than those contributing to the development of icterus are 
not shown, then the “chains” of causation become difficult to elu- 
cidate and the whole genealogy might be thought of more appro- 
priately as a web, which in its complexity and origins lies quite 
beyond our understanding. 

Fortunately, to effect preventive measures, it is not necessary 
to understand causal mechanisms in their entirety. Even knowl- 
edge of one small component may allow some degree of pre- 
vention. 

The concept of the “chain’’ mechanism is that many variables 
may be related to a single effect through a “‘direct-indirect’”’ mech- 
anism in which C is causally related to D, D to E, E to F, and so on 
until finally maybe Q plays an important part in the development 
of the disease. We may note that many variables other than those 
directly in the chain enter the genealogy at each step of the 
chain. Consequently, the longer the chain the weaker the associa- 
tion. A preventive attack on the disease depends on finding an 
element in the chain which can be eliminated and which is sufh- 
ciently close to Q in the mechanism for its elimination to have a 
substantial effect on Q. Thus the injection of hepatitis virus in 
venereal disease clinics can be prevented at a number of different 
points in the chain, and wherever the chain is broken the disease 
will be prevented, even though no attempt has been made to 
alter, for example, the immunity status of the patients injected. 

The effect that the elimination of Q will have on the total in- 
cidence of the disease will depend on the importance of Q relative 
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to other factors that, descended from independent sources, also 
are causally related to the disease. ‘The effect of these independ- 
ent variables may be thought of as additive: the elimination of 
any one of them may have an appreciable effect on the disease 
frequency, but nevertheless the effect of the other factors remains. 
For example, the elimination of serum hepatitis associated with 
treatment for syphilis will have only very indirect effects on the 
disease arising from other sources such as blood transfusion. 

Ideally, one would search for components of the causal mech- 
anism that are necessary causes — causes without which the dis- 
ease does not occur — even though such causes may not be of 
themselves sufficient to produce the disease. Thus, Mycobac- 
terium tuberculosis may be thought of as the necessary cause of 
tuberculosis, and other factors such as age, nutritional status, sex, 
genetic constitution, and so on, as additive ‘“‘contributory”’ factors. 
However, this distinction between causes and contributory factors 
is arbitrary and often quite meaningless. Most commonly, the 
chosen factor is the necessary cause only by definition, in the sense 
that automobiles are the necessary cause of automobile accidents. 
Thus M. tuberculosis may be the necessary cause of tuberculosis 
as it is presently defined, but had medical knowledge developed 
differently, this pathologic state might have been included in the 
definition of a specific nutritional defect. In that case the nutri- 
tional defect would have been the necessary cause and the bacillus 
a contributory factor (while there is a definite statistical associa- 
tion between the tubercle bacillus and tuberculosis, the bacillus 
is frequently found in individuals who are not ill, and sometimes 
is not found in individuals with an illness clinically indistinguish- 
able from tuberculosis). 

The method of quantitative estimation of the disease incidence 
attributable to identified causal factors, as described in Chapter 
13, seems preferable to semantic exercises aimed at a hierarchic 
classification of causes. Once again we may note that the practi- 
cal implications of a causal association may be influenced by 
considerations other than quantitative estimates of the “impor- 
tance” of the association. Thus the role of microbiologic agents 
in infectious disease has been considered predominant over that 
of genetic factors, perhaps in part because their importance be- 
came evident earlier but primarily because it is a good deal easier 
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to control human water supply and to eradicate insect vectors 
than to breed genetically resistant populations. 


CAUSATION AND PREVENTION 


The etiology of a disease may be thought of in two parts: (1) 
those events occurring prior to some initial bodily change (incep- 
tion of the disease), and (2) those mechanisms operative within 
the body which lead from the initial change to the characteristic 
manifestation of the disease. Thus, one can conceive of a “‘basic’’ 
association or group of associations of high strength and high 
specificity which involves the effect of a specific environment on 
a specific type of individual. For example, it may be that penetra- 
tion of a specifically predisposed marrow cell by ionizing rays 
will lead to genetic mutation in a very high proportion of cases, 
and, if circumstances are appropriate for the development of the 
cell, to the patient’s becoming leukemic. The reaction of the 
marrow cell to the radioactivity may be considered the direct 
causal association (although it is no more direct than our meth- 
ods of examination are capable of detecting), and we may de- 
scribe: (1) the series of prior events which brought the radio- 
active particles in contact with a susceptible marrow cell, and 
(2) the series of events initiated by the mutation which finally 
resulted in the death of the patient. 

This is, of course, a gross oversimplification of the develop- 
ment of a disease. To take a historical analogy, we speak of an 
“epoch” as an event which resulted in a significant change in the 
course of a country, for example a war or a revolution. Actually, 
such events consist of an infinite number of different events in- 
teracting with each other but each having its own distinct ante- 
cedents. Interference with any one of these individual events may 
critically influence the course of the whole epoch. Nevertheless, 
viewed from a temporal or geographic distance, an epoch becomes 
thought of as a single event either because of the blurring asso- 
ciated with the long-range view or because of the arbitrary selec- 
tion of some point as the “turning point’ of the epoch. We can 
then speak of the periods preceding and succeeding the epoch, as, 
for example, the prewar era and the postwar era, and describe the 
events leading up to the war and the results following it. 
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Prevention (and epidemiology) are concerned predominantly 
with those successions of events which result in the exposure of 
specific types of individuals to specific types of environments lead- 
ing to or aggravating bodily changes in those individuals (pre- 
inception era). Therapy is concerned with the mechanism initi- 
ated in the basic bodily changes which finally results in the 
manifest signs and symptoms of the disease (postinception era). 
This distinction may lead to differences between practitioners of 
prevention and of therapy in their concepts of what constitutes an 
important causal association. For example, the therapist may be 
satisfied to know that deficiency of pancreatic secretion is an 
important cause of diabetes mellitus, since if this deficiency is 
remedied the symptoms of the disease are significantly modified. 
Such knowledge is inadequate for prevention, however, for which 
purpose more pertinent questions would be what caused the pan- 
creatic deficiency. 

If knowledge of causal associations is sought predominantly 
for its practical application, there is one other characteristic of 
such associations that it is desirable to determine: the side effects 
of alteration of the cause. For, just as any effect has multiple 
causes, the alteration of any cause may be expected to have mul- 
tiple effects besides the one intended, and in any prevention 
program the side effects of alteration of the cause must be accep- 
table. On the one hand, a certain loss of character in the taste of 
public water supplies is apparently an acceptable price to pay for 
a typhoid-free existence, but on the other, the citizens of many 
communities have expressed their view that a predictable re- 
duction in the incidence of dental decay is not worth the conjec- 
tured risks that fluoridation of water supplies might entail. 

Knowledge of causal associations that do not offer preventive 
possibilities, either because the cause is unalterable or because the 
side effects are unacceptable, is nevertheless important. Knowl- 
edge of such associations is necessary for general understanding of 
the nature of disease, and, in addition, allows more accurate studies 
directed toward the identification of causal associations that do 
offer preventive possibilities. ‘Thus, association with age can hardly 
be thought of as an association with preventive application, but 
if epidemiologic studies of chronic disease did not take this asso- 
Clation into account they would be meaningless. 
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Ways of Classifying Ill or 
Disabled Persons 


Classification of disease is sometimes a prerequisite and some- 
times a goal of epidemiologic study. To understand the methods 
of epidemiology it is necessary to have some concepts of the 
methods and purposes of the classificatory process. 

Classifications are arrangements of things into groups so that 
the members of one group can be distinguished from those of 
another. The things arranged may be tangible objects (e.g., 
books, jewels, or persons) or abstract concepts (e.g., countries of 
birth, religious experiences, or even classifications) (Hempel, 
1959). 

The purpose of classification is simple: while it is not possible 
to form generalizations from an individual thing, consideration of 
similar things in a group does permit generalizations. Study of 
things in a group may be productive of generalizations that are 
(1) explanatory of the origin of the group, (2) predictive of 
changes in the group with the passage of time subsequent to its 
delineation, or (3) both explanatory and predictive. 

Classifications themselves are traditionally designated as arti- 
ficial or natural. Linnaeus’s system of dividing plants into twenty- 
four classes according to the stamens is often cited as an example 
of an artificial system, while those classifications which fit in 
with current evidence on plant evolution are said to be natural. 
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The distinguishing of classifications into these two types does 
not seem particularly realistic. The level of existing knowledge 
largely determines what criteria are used to arrange things into 
eroups. In a time not yet influenced by Darwinian theory it seems 
not at all unnatural that differences between plants in the char- 
acteristics of the stamens were considered important. Further, 
the particular interest of the observer determines the kind of 
criteria selected to identify groups; for example, a landscape 
gardener may find it useful for his purposes to classify bushes as 
large and small. 


Groupings of Ill or Disabled Persons 


The arrangement of ill or disabled persons into groups so that 
the members of one group may be distinguished from the mem- 
bers of another (with respect to specified criteria connected with 
the illness of each individual) is essential to the concept of “sepa- 
rate’ diseases or disabilities. A group of people so classified is said 
to suffer from a disease or a disability. Study of such a group may 
lead to generalizations of therapeutic, epidemiologic and pre- 
ventive importance. 

The criteria by which the members of one group are distin- 
guished from those of another depend, of course, on available 
knowledge and techniques. At the moment two rather distinct 
types of criteria are used to delimit one disease or disability from 
another: 

(1) Manifestational criteria. The ill persons are grouped 
according to their similarity with respect to several simultane- 
ously present manifestations of illness such as symptoms, signs, 
changes in body fluids or tissues, physiologic function, thinking, 
behavior, concurrence in time, place or person, or prognosis. Ex- 
amples are fracture of the neck of the femur, diabetes mellitus, 
idiopathic mental retardation, the common cold, Devonshire colic, 
miner’s asthma, and cervical cancer. : 

(2) Experiential criteria. Here the grouping of the ill per- 
sons depends on similarity with respect to a specified type of ex- 
perience begun at a time preceding the illness, and often said, 
loosely, to be the cause of a disease. Examples of experiential 
criteria in current use are difficult birth, combat, inhalation of 
silica dust, and internalization of the tubercle bacillus. 
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Theoretically it is possible that the same group of ill individ- 
uals might be identified through the separate use of either type 
of criterion. For example, it is conceivable that the manifesta- 
tions of measles (rubeola) are so obvious and distinctive that the 
group of persons identified on the basis of similarity of symptoms 
and signs might contain just those persons identified separately 
on the basis of encountering the measles virus for the first time. 

In most instances, however, such a correspondence following 
separate application of the two types of criteria would be unex- 
pected. Figure 3, representing an array of ill persons, is a simple 


Ficure 3. Model for classification of ill persons according to manifestational 
(unshaded) and experiential (shaded) criteria. 


model intended to illustrate this concept. The square areas M,, 
M; . . . Mg are groups of ill persons, the members of one group 
having been distinguished from those of another only by mani- 
festational criteria. The shaded area, C,, represents ill persons 
identified separately on the basis of a specified experience. It may 
be noted that the correspondence is not perfect: ill persons in- 
cluded by the experiential criterion although mainly belonging 
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to M, are also derived from four other manifestational groups, 
Mo, M:, M; and Mp. 

This lack of complete correspondence seems to result from two 
facts: (1) a single specified experience (selected from the entire 
causal web) usually has polymorphous effects; and (2) a particu- 
lar manifestation at times arises from diverse experience. For ex- 
ample, some pedestrians injured by an automobile may present a 
fractured tibia, others a simple laceration of the skin, and so on; 
but among all persons with a fractured tibia only a fraction have 
experienced a blow from an automobile. 

The groups identified by either type of criterion may properly 
be regarded as entities of disease or disability. It should be borne 
in mind, however, that in both cases the entities are not the tan- 
gible objects (ill persons) from which they were derived but are 
abstract concepts whose limits may be revised at will, depending 
on the particular purpose in mind. In the example just given the 
entity “fractured tibia” is particularly useful to the orthopedic 
surgeon in devising therapeutic measures, while the entity “blow 
by automobile” is more appropriate to the public health worker 
interested in designing preventive measures. 


Descriptive Epidemiologic Observations as Manifestational 
Criteria 


The setting of the limits of a disease entity on the basis of 
manifestations is often a highly complex process, and an intuitive 
one. An underlying assumption governing it is that the greater 
the number and kind of similarities (descriptive of each individ- 
ual), the more properly the illness of these individuals may be 
termed an entity; conversely, striking dissimilarities are taken as 
evidence against the illness being an entity. Attention here is 
confined to the role played by epidemiologic observations in this 
process. 

Concurrence in respect to characteristics of time or place or 
person is one of the kinds of similarity commonly used in con- 
structing a disease entity. Numerous examples of this use of de- 
scriptive epidemiology may be found. Caverly, describing an out- 
break of paralytic disease occurring in Vermont in the summer of 
1894, included in his arguments for its being a separate disease 
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(poliomyelitis) the season of the year during which the illnesses 
occurred; the aggregation of colic at Devonshire helped to delin- 
eate a special type of colic from colic in general (Baker, 1767); 
and the alternation of anencephaly and spina bifida within the 
same sibship, plus the fact that both manifestations are noted 
more among females than among males, provided a reasonable 
basis for considering them as one entity rather than two (Mac- 
Mahon et al., 1953). 

Epidemiologic dissimilarity, on the other hand, may be used as 
an argument in favor of dividing one entity into two. For ex- 
ample, Fogh-Andersen (1942) has advanced the opinion that 
harelip (with or without cleft palate) is an entity to be distin- 
guished from cleft palate (without harelip), because the former 
is primarily an affection of males and the latter of females. 


A Special Class of Experiential Criterion and an Analogy 


A special class of experiential criterion, internalization of mi- 
croorganism, has been so valuable to both treatment and preven- 
tion that it deserves study as a model which might have relevance 
to other classes of experiential criterion. There is no obvious rea- 
son, for example, why such another class might not be internali- 
zation of social stress. 

Five aspects of the application of internalization of microor- 
ganism as an experiential criterion appear pertinent to generali- 
zations about the use of experiential criteria in delineating 
disease entities. 

(1) The disease entity so derived bears by definition a sine 
qua non relationship to the criterion as a result of the criterion 
having been selected (by man, for his own purposes) from an 
infinite causal complex. 

(2) Evidence indicating causal association between internal- 
ization of microorganism and morbid processes, although re- 
quired, does not necessarily have to be watertight. A series of 
conditions which it would be desirable to fulfill in order to estab- 
lish such causal connection and frequently referred to as Koch’s 
postulates are itemized by Topley and Wilson (1936) as follows: 

(a) The microorganism should be found in all cases of the 
disease in question, and its distribution in the body should be in 
accordance with the lesions observed. 
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(b) The microorganism should be cultivated outside the 
body of the host, in pure culture, for several generations. 

(c) The organism so isolated should reproduce the dis- 
ease in other susceptible animals. 

The comments of Topley and Wilson on technical difficulties 
which may hinder fulfilling all Koch’s postulates in every instance 
are illuminating. In practice not infrequently the evidence ob- 
tained appears to be incomplete by Koch’s standards. Further, 
Koch’s third postulate, if indeed it refers to “other susceptible 
animals” as stated by Topley and Wilson, in reality hedges about 
establishment of causal] association. Man in fact would be the ani- 
mal necessary for a more convincing demonstration of causal con- 
nection between his disease and a microorganism. 

(3) The phrase in Koch’s first postulate, “‘the disease in 
question,” implies that an existing manifestational entity be nec- 
essarily suspected of having causal relationship with a microor- 
ganism. Possibly this is true in some instances. However, extra- 
ordinary revisions of the limits of previously acceptable disease 
entities have commonly followed identification of microorganisms 
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Ficure 4, Classification of persons with tuberculosis according to manifes- 
tational (unshaded) and experiential (shaded) criteria. 
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which have deleterious effects on man. The revisions which fol- 
lowed identification of the tubercle bacillus by Koch (1882) offer 
a classic example. The disease entity constructed around this 
specified experiential criterion, internalization of the tubercle 
bacillus, removed parts from many manifestational entities found 
widely scattered among the major categories of disease in the 
nosologies of preceding times. For example, Cullen’s classification 
(1785) consisted of the four manifestational categories indicated 
in Figure 4. Each of the four no doubt contained, in differing pro- 
portions, ill persons whom today it is found useful to classify as 
tuberculous (shaded portion of the diagram). It would appear 
that the limits of the “disease in question’? may be determined 
with clarity only after the experiential criterion has been specifi- 
cally defined. 

(4) Evidence has not been required that internalization of 
microorganism be invariably disabling. In Figure 5 an array of 
ill persons is presented as in Figure 3. Here, however, the model 
includes non-ill persons in the surrounding area. It is seen that 
non-ill as well as ill persons may satisfy the selected experiential 
criterion. The size of the non-ill group may be larger than that of 
the ill. For example, we presently acknowledge that only a frac- 
tion of all persons who internalize the tubercle bacillus become 
ill as a consequence. 

(5) The “necessary” experience may be an experiential 


Ficure 5. Model for classification of ill and non-ill persons according to mani- 
festational (unshaded) and experiential (shaded) criteria. 
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complex. Two types of microorganism may be required. Swine 
influenza, for example, depends on the simultaneous presence of 
bacillus and virus (Shope, 1931). Or the requirement may em- 
brace a non-microorganismal component in conjunction with a 
microorganismal one. In the guinea pig intratracheal introduction 
of silica suspension and dead tubercle bacilli produces massive 
nodular lesions resembling those of silicotuberculosis in man, 
while neither agent alone excites obvious disease reaction in the 
experimental animal (Kettle, 1934). 

Mention was made at the outset of this section that classes of 
experiential criterion other than internalization of microorgan- 
ism might be useful in delineating experiential disease entities, 
and that another such class might be internalization of social 
stress. A hypothetical example is mother-death during infancy. It 
may be noted that this hypothetical criterion is eminently identi- 
fiable in all cases, perhaps with a precision greater than that with 
which microorganisms may be identified. The possibility that 
such a class of experience may become internalized in a manner 
analogous to internalization of microorganism has been pointed 
out by Wiedorn (1955). The ‘‘disease in question,” as finally 
delineated, might include parts of such manifestational entities 
as mental retardation, schizophrenic reactions, personality disor- 
ders, etc., just as parts of the pyrexiae, locales, neuroses and ca- 
chexiae of Cullen became tuberculosis. Distribution of the experi- 
ence “in the body . . . in accordance with the lesions observed”’ 
is entirely compatible with current ideas that an experience of the 
nature exemplified is stored in the brain and that the brain is the 
mediator of perception, thinking, emotion and behavior. Evi- 
dence suggesting causal relationship between loss of parent and 
morbid processes has been demonstrated by numerous case his- 
tory studies, but how often the experience is disabling has yet to 
be determined. 


Some Postulates for Constructing an Experiential Disease 
Entity 


In connection with the introduction of experiential classifica- 
tions in place of existing manifestational categories, the follow- 
ing seem pertinent: 

(1) The manifestations must be of such a nature that causal 
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association with the specified experience seems probable, even 
though the manifestations may seem diverse in that they embrace 
portions of several accepted manifestational entities. 

(2) The experience must be regularly followed by unde- 
sirable effects on health in a substantial proportion of those un- 
dergoing it and the undesirable effects must not be outweighed 
by desirable ones. 

(3) The experiential entity constructed must have promise 
of utility. 

In what respects must construction of an experiential disease 
entity have promise of utility? When the experience becomes in- 
ternalized and continues in time within the recipient, the advan- 
tages to the therapist should be obvious. The group of ill individ- 
uals at hand have illnesses highly dependent on a common 
experience, and the experience (whether internalization of a par- 
ticular microorganism or of stress from mother-death during in- 
fancy of the patient) becomes a focus for special therapeutic at- 
tention. Antibiotics or specific psychotherapeutic techniques may 
be developed. In addition, there are advantages to the public 
health worker interested in prevention. Means may be devised to 
prevent exposure of the infant to the microorganism or the infant 
may be provided with fortification such that an inevitable inter- 
nalization of the microorganism has a minimum of ill effects; sim- 
ilarly, means may be devised, if not to prevent loss of mother, to 
give infants so deprived supportive measures sufficient to reduce 
substantially the likelihood of subsequent ill effects. 

Finally, lest the impression be left that experiential classifica- 
tions have some virtue not shared by manifestational ones, it 
should again be noted that the same group of ill persons may be 
arranged to create entities of disease according to either mani- 
festations or causal experiences or both, depending on the particu- 
lar purpose in mind. The word “entity” has come to have such a 
deceptively comforting sound in current medical usage that 
J. S. Mill’s remarks (1862) seem as pertinent to the medical pro- 
fession as to the logicians: “Entity, a piece of barbarous Latin, 
invented by the schoolmen to be used as an abstract name, in 
which class its grammatical form would seem to place it; but be- 
ing seized by logicians in distress to stop a leak in their terminol- 
ogy, it has ever since been used as a concrete name.” 


4 
The Strategy of Epidemiology 


The predominant purpose of epidemiology was described in 
Chapters | and 2 as the search for causal association between dis- 
eases or Other biologic processes and specific environmental ex- 
periences. The methods used to further this purpose may be classi- 
fied as follows: 

(1) Descriptive epidemiology — description of the distribu- 
tion disease, with comparisons of its frequency in different popu- 
lations and in different segments of the same population. 

(2) Formulation of hypotheses— tentative theories de- 
signed to explain the observed distribution of the disease in terms 
of causal associations of the most direct nature possible. 

(3) Analytic epidemiology — observational studies de- 
signed specifically to examine the hypotheses developed as a re- 
sult of the descriptive study. | 

(4) Experimental epidemiology — experimental studies on 
human populations to test in a stringent manner those hypotheses 
that stand the test of observational analytic studies. 

If epidemiology operated independently of knowledge gained 
in other biologic and medical disciplines, these methods of in- 
vestigation might be expected to proceed in the order listed. 
However, according to the contribution made by other disciplines, 
particularly by clinical medicine and laboratory experiment, any 
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one, none, or all of these methods may be used in epidemiologic 
investigations of disease. 

The investigation of certain diseases has followed these steps in 
the order listed. An example is the investigation of cholera. ‘The 
descriptive phase of the investigation resulted in observations that 
cholera spread along the lines of human communication, that its 
progress was never faster than the speed of human travel, that 
persons coming into close contact with cholera patients fre- 
quently contracted the disease, that its occurrence was connected 
with poverty and overcrowding, and that it was associated with 
refuse and filth. These facts, plus clinical and pathologic observa- 
tions, led Snow (1855) and others to the hypothesis that the dis- 
ease occurred when a healthy person swallowed specific cellular 
agents discharged from the bowel of a cholera patient. ‘The series 
of analytic studies that Snow conducted between 1848 and 1854 
to test this hypothesis are classics of epidemiologic investigation 
and will be referred to frequently. The experimental phase of the 
cholera investigations consisted of attempts to keep drinking wa- 
ter free of contamination by sewage; in spite of the haphazard 
nature of these experiments and the absence of controlled studies 
in the modern sense, it was demonstrated that if these efforts were 
successful, freedom from cholera resulted. 

The investigation of the association between regular use of 
drinking water containing greater than average concentrations of 
fluoride and low rates of dental caries has also followed this se- 
quence — from early demonstration of geographic patterns of 
dental caries and of association of mottled enamel with low caries 
rates to later experiments involving the addition of sodium fluo- 
ride to the drinking water of selected populations. 

In other areas the gamut of investigation has not yet been fully 
run. For example, descriptive observations of lung cancer re- 
vealed two striking features: (1) the disease is several times more 
common in men than in women, and (2) its increase during the 
present century has been striking. These features considered in 
connection with clinical and pathologic knowledge led to the 
hypothesis that cigarette smoking is causally related to lung can- 
cer. Many analytic studies designed to test this hypothesis have 
uniformly supported it. The nature of the environmental influ- 
ence implicated in this instance is such that an experimental test 
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has not been conducted in man, and in fact may not be practica- 
ble. However, the results of the observational analytic studies are 
so compelling that there is little question as to the outcome of 
such an experiment, were it to be conducted. 

For many other diseases the hypotheses that have been de- 
veloped as a result of substantial descriptive data have not re- 
ceived adequate analytic tests. Hypotheses linking cancer of the 
cervix and circumcision of the marital partner, cancer of the 
esophagus and alcohol, cerebral palsy and birth trauma, and heart 
disease and diet may be considered in this category. 

In other diseases specific hypotheses have not yet been de- 
veloped, either because of difficulty in assembling descriptive 
data on a disease, or because the descriptive variables that are or- 
dinarily measured are not related to the causal factors involved. 


DESCRIPTIVE EPIDEMIOLOGY 


A logical inquiry into the nature of an unknown quantity must 
open with questions of a general nature. This is true whether the 
unknown quantity is a specific environmental situation associated 
with development of a particular disease or is an unseen object to 
be identified by a maximum of twenty questions in a parlor game. 
Thus, in the latter case, when items of great rarity, such as the 
mummified left great toe of Ramses II, must be identified, the 
chance of success through unguided guesses is remote. More 
profitably, the inquiry opens with general limiting questions; for 
example, is it animal, vegetable or mineral? If satisfactory an- 
swers are received to these questions (and in the present example 
they may not be), the inquiry may proceed with similar general 
questions as to its age, color, consistency, and so on. Eventually, 
the number of possibilities compatible with the answers resulting 
from these general questions may be sufficiently limited for spe- 
cific guesses. 

Like experts in the parlor game, epidemiologists have developed 
their own set of general limiting questions. The epidemiologist is 
limited because his questions are not so easily answered as those 
in the parlor game; consequently, the value of an answer to a 
general question must be weighed against the expense and time 
involved in deriving it. Since the collection of data is expensive, 
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the descriptive phase of epidemiologic investigation is heavily in- 
fluenced by what information is available, and the variables most 
frequently examined are those on which information is already at 
hand in such sources as vital records, hospital records, and pub- 
lished reports. 

The variables most commonly examined have been classified as 
descriptive of time, place and person. They include: 

(1) Characteristics describing the t#me in which persons 
were found affected: 
Year 
Season 
Day 
(2) Characteristics describing the place in which persons 
were found affected: 
Country 
State, or district within countries 
Urban or rural 
Local community 
(3) Characteristics describing the persons affected: 
Age, sex and ethnic group 
Occupation, education, and socio-economic status 
Marital status 
Family history 

Many descriptive variables do not fit neatly into this classifica- 
tion. For example, order of birth and maternal age might be con- 
sidered as properly belonging to all three — person, place and 
time. 

The variables listed are those things that are measured, and are 
not necessarily those which have direct causal connection to the 
disease being examined. Thus an association of a disease with a 
particular place might well be a circumstance of the particular 
persons that inhabit that place, and an association with a variable 
that is descriptive of a person, for example, his race, might equally 
well be the result of factors peculiar to the place of residence of 
that race. The classification should be thought of as a form of mne- 
monic rather than as an attempt at meaningful categorization. 
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FORMULATION OF HYPOTHESES 


In epidemiology, as in science in general, the thought processes 
involved in the formulation of hypotheses have not been clearly 
formalized and classified. Yet a new and convincing hypothesis 
can be one of the most powerful forces influencing the direction of 
future research, and the success or failure of that research fre- 
quently depends on the soundness of the hypothesis. 

The most common hypotheses in epidemiology seek to identify 
causes sufficient to explain the distribution of a disease noted in 
its descriptive study. Four methods of reaching hypotheses are 
discussed. * 


The Method of Difference 


If the frequency of a disease is markedly different under two 
different circumstances and some factor can be identified in one 
circumstance that is absent in the other, this factor, or its absence, 
may be a cause of the disease. 

The difficulty with this method of formulating epidemiologic 
hypotheses is usually not one of being unable to identify a fac- 
tor that is present in one circumstance and not in the other (or 
relatively so), but rather of the multiplicity of hypotheses which 
may often be suggested on the basis of the difference. For exam- 
ple, literally thousands of possibilities may be suggested to explain 
differences in disease frequency between males and females, or 
between the United States and India, or between certain age 
groups. For this reason, any one descriptive observation, even 
though revealing a marked characteristic of the disease under 
study, is seldom sufficient for the establishment of a sound hy- 
pothesis. Occasionally one feature of difference between two cir- 
cumstances is sufhciently striking to favor it as a likely basis for 
hypothesis, but this situation is unusual. For example, although 
carcinoma of the cervix is one of the commonest forms of cancer in 
females, it is extremely rare in nuns. The differences between nuns 
and other females are not so numerous as to make unworthy of 


* The first three methods are patterned after three of Mill’s five canons of induc- 
tion (1862), which dealt with the definition and proof of causation, rather than the 
formulation of hypotheses. 
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further investigation the hypothesis that some aspect of the repro- 
ductive process predisposes to carcinoma of the cervix. 


The Method of Agreement 


If a factor is common to a number of different circumstances 
that have been found to be positively associated with the disease 
under study, this factor may be a cause of the disease. 

For example, cholera is associated with patient contact, with 
overcrowding, and with sewage-contaminated water supply. A 
common factor is the opportunity to ingest fecal discharges from 
cholera patients. This may be a cause of cholera. Again, lung can- 
cer is more common among men than women and was more com- 
mon in 1950 than in 1900. Cigarette smoking, among other things, 
shares the same distribution and consequently may be a cause of 
lung cancer. 


The Method of Concomitant Variation 


This method involves the search for some mechanism whose fre- 
quency or strength varies with the frequency of the disease. It is a 
quantitative rather than a dichotomous way of looking at the 
same ideas that are involved in the methods of agreement and dif- 
ference. A well-known example is the attempt to relate the relative 
frequencies of various dietary constituents to the incidence of 
coronary heart disease in different areas of the world. Another is 
the comparison of the fluoride content of drinking water in vari- 
ous areas to the frequency of dental caries in the inhabitants. 


The Method of Analogy 


The distribution of a disease may be sufficiently similar to that 
of some other disease that has been more completely and success- 
fully investigated to suggest that certain causes may be common 
to the two diseases. This method of arriving at hypotheses is re- 
lated to the process of deduction, whereby derived epidemiologic 
principles are applied to specific situations. Thus we tend to as- 
sociate diseases occurring in certain places during the summer 
months with vector transmission, and we are reluctant to accept 
diseases as infectious that do not spread rapidly. Even in the 
absence of general laws or principles, however, this method 
may be of service. For example, the observation that the age and 
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sex pattern for pulmonary tuberculosis in adults had features 
similar to those of pulmonary cancer and the knowledge that lung 
cancer is related to smoking led Lowe (1956) to test, with some 
success, the hypothesis that smoking may also be a factor in the 
breakdown of pulmonary tuberculosis in later life. 

The method of analogy, while useful, is also treacherous; con- 
siderable errors have been made as the result of false analogies. 
Thus, because genetically determined diseases tend to run in 
families, there has been a tendency until quite recently to assume 
that familial concentration of a disease is of itself sufficient evi- 
dence of genetic determination. This view is of course demon- 
strably false. Similarly, too close analogy from experimental pro- 
duction of congenital malformations to hypotheses concerning 
their etiology in man has led to many hypotheses of doubtful 
value. 


Evaluation of Hypotheses 


Having discussed the most common methods of arriving at 
epidemiologic hypotheses, we may note some principles concern- 
ing them, based on consideration of those hypotheses that have 
been most successful in the past. 

(1) The value of a hypothesis is inversely related to the 
number of acceptable alternatives. The number of acceptable al- 
ternatives again depends on a variety of circumstances: 

(a) In the use of the methods of agreement or difference, 
the greater the number of circumstancés in which the suspected 
factor is present or absent and in which the disease frequency is 
correspondingly increased or decreased, the fewer the number of 
acceptable alternatives. As noted, an association with one variable 
does not usually lead to the development of a convincing hypo- 
thesis. Association with two independent variables may narrow 
the field considerably. For example, relatively few hypotheses ex- 
plain both the high incidence of cancer of cervix in married 
women and the low incidence in nuns, although either one of 
these associations might have multiple explanations. When yet a 
third observation is added — for example, the low incidence of 
cervical cancer among Jewish women — the possibilities are re- 
duced even further. 


(b) If two variables are both found to be associated with 
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a disease, the closer the association between the two variables 
themselves, the less valuable is the information regarding their as- 
sociation with the disease. For example, many factors may be con- 
ceived that are common to one of the Census Bureau’s “profes- 
sional” occupations and residence in a favored residential district. 
Any disease associated with one of these variables will almost al- 
ways be associated with the other. Much more limited in number 
are factors common to holding a professional occupation and, 
say, singleness, since the correlation between occupation and 
marital status is weak. 

(c) Associations with certain variables, for example, occu- 
pation and religion, may be of greater value than those with oth- 
ers, such as age and sex, since the specific environmental circum- 
stances associated with the former may be fewer than those as- 
sociated with the latter. 

(2) It is not necessary that the suspected mechanism be 
present in all the circumstances under which the disease fre- 
quency is found increased. Inconsistencies between what is ac- 
tually found and the expectation of what will be found based on a 
hypothesis may be due to: 

(a) Multiple causes. As discussed in Chapter 2, our con- 
cern is with a cause of a disease, not the cause. Causes other than 
the one suspected may be responsible for those observations that 
do not fit into the hypothesis. For example, not all people who in- 
gest cholera-contaminated sewage develop cholera. ‘This in no way 
detracts from the hypothesis linking sewage and cholera, since it 
is quite certain that there are differences between individuals in 
the degree to which they are susceptible to the organism. 

(b) Crudity of the disease classification used. For exam- 
ple, nonsmokers develop lung cancer (although much less often 
than smokers). At first sight this is inconsistent with the hypothe- 
sis linking smoking and lung cancer. However, subsequent investi- 
gations have shown that certain uncommon pathologic varieties of 
lung cancer are not related to cigarette smoking. 

(c) Errors of observation. 

(3) New hypotheses are commonly formed by relating ob- 
servations from several different fields. In the area of disease 
causation, epidemiologic findings are most profitably viewed in 
the light of clinical, pathologic and laboratory observations. ‘Thus 
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Snow interpreted the epidemiologic observations of cholera in the 
light of clinical and pathologic observations that the intestine was 
the apparent focus of the disease. 

(4) The stronger a statistical association, the more useful is 
it as a basis for a hypothesis. “Strength” means not the degree of 
statistical significance, but the degree to which the situation of a 
disease being entirely absent in one circumstance and invariably 
present in another is approached. Associations of low strength, 
however significant statistically, are rarely productive of strong 
hypotheses. 

(5) Change in the frequency of a specific disease over time, 
although very difficult to substantiate, is one of the variables that is 
most productive of hypotheses. This is particularly true of changes 
that occur over a relatively short period. For example, in London 
in the early nineteenth century, the water was supplied by a num- 
ber of different private companies. Snow (1855) observed that in 
1849 death rates from cholera were high in those areas in which 
the water was supplied by the Southwark and Vauxhall Company 
and the Lambeth Company. Five years later, in 1854, the num- 
ber of deaths in the area supplied by the Southwark and Vauxhall 
Company was somewhat higher, but there was a substantial re- 


TABLE 1 


Mortality from cholera in the areas of London supplied by the South- 
wark and Vauxhall, and Lambeth Water Companies in 1849 and 1854 * 


Number of deaths 


Districts with water supplied by attributed to cholera 
1849 1854 

Southwark and Vauxhall Company 2261 2458 
Lambeth Company 162 37 
Both companies 3905 2047 


* From Snow, 1855. 


duction of deaths in the Lambeth Company area (Table 1). 
There was no large scale population shift that might have ac- 
counted for the change. It was attributed by Snow to the fact that 
in the intervening years the Lambeth Company had changed its 
sources of supply to the higher reaches of the Thames River, 
above, Snow supposed, the greatest sources of contamination. 

The attractive feature of this hypothesis is the relatively few 
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factors that can be found to have undergone rapid changes coin- 
cident with the observed change in disease frequency. 

(6) An isolated or unusual case should receive particular 
attention. Such cases, for example, in persons living in communi- 
ties otherwise free of the disease, have been of great value in the 
infectious diseases and figure prominently in the investigations of 
Snow on cholera and in those of Panum on measles (1846). 


ANALYTIC EPIDEMIOLOGY 


The testing of a hypothesis may in part be carried out by fur- 
ther descriptive observations — observations that were not avail. 
able at the time the hypothesis was formulated. For example, if a 
relationship between smoking and lung cancer is hypothesized on 
the basis of the unusual sex ratio and the increasing incidence 
over time, one could further inquire whether lung cancer is most 
frequent in those countries where cigarette smoking is most preva- 
lent, or whether lung cancer is relatively infrequent among occu- 
pational, ethnic or religious groups whose members smoke little or 
not at all. Wynder and Lemon (1958) examined the incidence of 
various cancers and of coronary heart disease among a group of 
Seventh-day Adventists. The group experienced only 10 per cent 
of the expected incidence of lung cancer. Successful prediction of 
associations based on a hypothesis is usually considered stronger 
evidence in favor of the hypothesis than is knowledge of the same 
associations at the time of the formulation of the hypothesis. 
However, this type of evidence is indirect, and is usually capable 
of alternative explanations. More direct methods of testing the 
hypothesis are usually sought. 

The most satisfactory evidence is that needed to test arguments 
of the type “If the difference between two populations in disease 
frequency is dependent on differences between the populations 
with respect to a specific factor, then, within each of the popula- 
tions, this factor should be found more frequently in people with 
the disease than in those without.” For example, if the difference 
between males and females in the frequency of lung cancer is the 
result of heavier smoking by males compared to females, then 
higher rates of lung cancer should be found among males and 
females who smoke than among those who do not. 
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The data required to test such an inference are rarely available 
in the common sources from which descriptive observations are 
made, and special studies directed specifically to obtaining the 
data are usually required. A classic example of such a study is 
Snow’s investigations of cholera in London. As already noted, 
Snow formed the hypothesis that the frequency of cholera in vari- 
ous geographic sections of London was related to the sources of 
water supply. He was particularly interested in the Southwark 
and Vauxhall Company which drew its water from the section of 
the Thames River most heavily contaminated with sewage, and 
the Lambeth Company which in 1854 drew water from the river 
above the reaches of greatest contamination. As shown in Table 2, 


TABLE 2 


Mortality from cholera in the districts of London supplied by the South- 
wark and Vauxhall, and the Lambeth Companies, July 8 to 


August 26, 1854 * 
Cholera 
Districts with water Population Deaths from death rate 
supplied by 1851 cholera per 1000 
population 
Southwark and 
Vauxhall Company 167,654 844 5.0 
Lambeth Company 19,133 18 0.9 
Both companies 300,149 652 2.2 


* From Snow, 1855. 


the areas of London supplied entirely by the Southwark and 
Vauxhall Company experienced in the summer of 1854 a rate of 
5.0 deaths from cholera per 1000 population, whereas the death 
rate in the areas supplied by the Lambeth Company alone was 
only 0.9 per 1000. A large area supplied by both companies ex- 
perienced a rate of 2.2 deaths per 1000, a rate midway between 
those for the areas supplied by either of the two companies alone. 
Clearly these observations are consistent with the hypothesis that 
persons drinking water supplied by the Southwark and Vauxhall 
Company had greater risks of cholera than those drinking Lam- 
beth Company water. However, there may be many factors other 
than their water supply that differed between these geographic 
areas in a way similar to that in which the cholera rates varied. 
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A more critical test of this hypothesis is provided by an exami- 
nation of the data for the area in which the water of both com- 
panies was available. If the water supply were related to the risk 
of cholera death, then, within the area supplied by both com- 
panies, those persons supplied by the Southwark and Vauxhall 
Company should have a higher death rate than those supplied by 
the Lambeth Company. This test would be particularly critical 
because, as Snow pointed out, within this area the two companies 
supplied persons of similar economic, occupational and age status. 
Frequently, adjacent houses were supplied by different compa- 
nies, and often the persons supplied were themselves unaware of 
the source of their supply. 

However, while information was available on the number of 
houses supplied by each company, no data were recorded on the 
source of water used by each cholera patient; consequently, spe- 
cial investigation was required. Snow inquired of relatives and 
others as to which company supplied water to each house in 
which a death from cholera occurred between July 8 and August 
26, 1854. The results are shown in Table 3. Within the area sup- 


TABLE 3 


Mortality from cholera in London, July 8 to August 26, 1854, related 
to source of individual water supply in three groups of districts* 


Group of Cholera 
districts Was aee ply Population wakes death-rate 
é‘ of individual from 
with water 1851 per 1000 
: houses cholera 
supplied by population 
Southwark and Southwark and 
Vauxhall Company Vauxhall Company — 167,654f 738 4.4 
Lambeth Company Lambeth Company = 19,133f 4 0.2 
Both companies Southwark and 
Vauxhall Company 98,862 419 4.2 
Lambeth Company 154,615 80 0.5 
Rest of London — 921,972 1,422 0.7 


* From Snow, 1855. 
+ Overestimated by a small amount, since this figure includes population with no 
water supply. 


plied by both companies, the cholera death rates for customers of 
each company were very similar to those of the same company’s 
customers in the areas supplied exclusively by that company. 
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Moreover, Snow pointed out, the death rate for customers of the 
Lambeth Company was no higher than that for the rest of Lon-: 
don, even though the majority of the Lambeth Company’s cus- 
tomers were located in the area supplied also by the Southwark 
and Vauxhall Company — an area in which the epidemic raged 
severely. The hypothesis that the drinking of water supplied by 
the Southwark and Vauxhall Company was associated with death 
from cholera was therefore confirmed. 

Of course, there is no clear-cut dividing line between descrip- 
tive and analytic epidemiology. Many studies contain elements of 
each, and in many cases there is often great difficulty in deciding 
whether a “‘specific’’ hypothesis is being tested or not. For exam- 
ple, examination of the geographic distribution of a disease might 
be considered a test of the hypothesis that the disease does or does 
not exhibit geographic variation in incidence. Nevertheless, in 
practice, most studies can be categorized as being directed prima- 
rily toward either formulation or testing of rather specific hy- 
potheses. 


Types of Analytic Study 


To illustrate the general forms that analytic epidemiologic 
studies may take, let us suppose that a study is to be under- 
taken to determine whether an association exists between maternal 
infection with rubella and congenital malformations in the off- 
spring. With respect to the two variables (rubella and malforma- 
tion) every individual must fall into one of the cells in the fol- 
lowing fourfold table, the frequencies being indicated by the let- 
ters a, b,c and d: 


Maternal 
ene Malformation in offspring Total 
during 

pregnancy Present Absent 
Present a b a+b 
Absent Cc d c+d 
Total a+c b+d a+b+c+d 


The question at issue is whether malformation is more com- 
mon in the offspring of mothers who have had rubella during 
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pregnancy than in the offspring of mothers who have not. In other 
words, one wishes to compare: 


a oe 
S275 p Wit cae 

There are two general ways in which such a comparison can 
be made. The most direct would be to take two groups of women, 
one of which (a + b) had and one of which (c + d) did not have 
rubella during pregnancy, and determine how many of each 
group showed malformation in the offspring. This method is 
based on the primary selection of an exposed and an unexposed 
population, and it provides a direct measure of the risk of being 
affected according to whether the suspected causal factor was pres- 
ent or absent. 

A second method would be to investigate the frequency with 
which rubella had occurred among the mothers of malformed 
and not malformed children. In this method the primary selection 
is of groups of affected (a + c) and unaffected (b + d) persons, 
with subsequent inquiry as to the frequency of the suspected 
cause in the two groups. This enables a comparison of: 


with Be 
atc b+td 
If 


is greater than , then must be greater than 


a b a 
a+c¢ b+d at b 


acd: However, except in unusual circumstances (see Chapter 
14), the values (a 4+ b) and (c + d) are not determined in this 
type of study, and direct estimates are not obtained of the extent 
to which the risk of bearing malformed offspring among women 
exposed to rubella during pregnancy exceeds that of nonexposed 
women. 

These two methods of investigation are commonly described as 
prospective (primary sampling of exposed and not exposed) and 
retrospective (primary sampling of affected and not affected), re- 
spectively. The rationale of this terminology is that the retrospec- 
tive method involves “looking backward’ from effects to 
preceding causes and the prospective method involves “looking 
forward” from supposed causes to subsequent effects. White and 
Bailar (1956) have pointed out that this terminology gives rise to 
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considerable confusion, since the terms retrospective and prospec- 
tive have everyday meanings quite different from those implied in 
their application to these special methods of study. In the general 
sense, a retrospective study is one based on past data or past events, 
whereas a prospective study is one planned to observe events that 
have not yet occurred. To some extent there is overlap between 
the general and epidemiologic meanings of these terms. ‘Thus, 
retrospective epidemiologic studies are usually, though not al- 
ways, retrospective in the general sense. However, prospective 
epidemiologic studies may depend on data already accumulated 
or on the occurrence of events subsequent to the primary sampling 
of the population. The method of prospective investigation using 
past data is an economical and satisfactory one and is being used 
increasingly. For example, Case and Lea (1955), to examine in 
1955 the relationship between mustard gas poisoning and lung 
cancer, selected a group of men who in 1930 were receiving 
pensions for the effects of mustard gas poisoning received during 
World War I, and examined the mortality experience of this 
group between 1930 and 1952. 

In order to avoid confusion of terms we will refer to studies in 
which the primary selection is of affected and nonaffected persons 
as ‘“‘case history studies,” and to studies in which the primary selec- 
tion is of exposed and nonexposed groups as “cohort studies.” 
“Cohort studies,” a term already in common use, indicates the 
investigation over time of an identified group or cohort of individ- 
uals. IThese terms are, in our opinion, clear and descriptive of 
the methods involved. 

The two methods have quite distinct areas of usefulness as 
well as their individual difficulties, which are discussed in some 
detail in Chapter 13 (Cohort Studies) and Chapter 14 (Case 
History Studies). It may be useful here, however, to compare 
these uses and difficulties in general terms. 

Cohort studies have two major advantages over case history 
studies: 

(1) Direct estimates of the risk associated with the suspected 
causal factor are obtained. Such estimates may sometimes be ob- 
tainable from case history studies, but the estimates will not be 
direct and will only be possible if the ratio of affected to unaf- 
fected individuals in the population is known. This ratio is not 
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usually known in a case history study, since a sample of the unaf- 
fected population rather than the total is commonly used and the 
sampling fraction is often unknown. 

(2) Spurious relationships resulting from bias in data-col- 
lecting procedures are somewhat less likely in cohort than in case 
history studies. For example, in case history studies, data on the 
frequency of past exposure to the suspected cause are frequently 
based on the patients’ memories of such exposures. Yet affected 
individuals (e.g., the mothers of malformed infants) may be more 
highly stimulated to remember past experiences than more fortu- 
nate individuals. Even when more reliable sources of information 
are used, the possibility of more thorough investigation of the af- 
fected than of the unaffected individuals is a constant problem in 
case history studies. In cohort studies, on the other hand, the 
division of the population into exposed and unexposed groups 
may be made without knowledge of, or even commonly prior to, 
the appearance of the end result, thus reducing considerably the 
possibility of this type of error. 

Cohort studies have one considerable disadvantage. By com- 
parison with case history studies they are generally laborious, 
time-consuming and expensive. Related to these characteristics 
is the additional difficulty of assembling in one study sufficient 
data for conclusive answers. This is illustrated in the studies that 
have been undertaken of the association of maternal rubella 
with congenital malformation. Even the combined results of four 
cohort studies (Hill et al., 1958) give only general ideas about 
the magnitude of the risks involved. 

The reciprocal advantages of case history studies are that they 
are relatively quick and inexpensive, they are easily repeatable, 
and large numbers of cases can be examined economically. There- 
fore, case history studies are commonly used when it is desirable 
to test as rapidly and as cheaply as possible several of the hypoth- 
eses that may have been offered to explain observed relationships, 
and to determine initially which of the several hypotheses are 
worthy of further exploration. Cohort studies are more frequently 
undertaken when the evidence from case history studies is fairly 
conclusive that a relationship exists, in an effort to obtain more 
detailed and more reliable data. This pattern has been followed in 
a number of investigations in recent years. In the investigations of 
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the relationship between smoking and lung cancer, for example, 
some fifteen case history studies had been published prior to the 
appearance of the first cohort study in 1954 (Doll and Hill, 1954). 

These generalizations about the uses of the two methods of 
study may be modified by a number of characteristics of the dis- 
ease under study. For example: 

(1) The more common the disease or trait to be investi- 
gated, the easier will be a cohort study, since the smaller will be 
the population that need be surveyed to identify a given number 
of cases. Conversely, the rarer the disease, the more difficult be- 
comes a cohort study, and the more desirable a case history 
study. 

(2) The shorter the interval between the suspected cause 
and the end result in a particular individual, the easier will be a 
cohort study. For example, cohort studies of association between 
events in pregnancy and characteristics of the offspring that are 
manifest at birth are greatly facilitated by the relatively short 
follow-up interval needed to record outcome. Much of the objec- 
tion to cohort studies is therefore removed. Conversely, study of the 
relationship between maternal deprivation in infancy and mental 
illness in adult life would be extremely difficult to attempt by the 
cohort method. 

(3) The more objectively the history of the suspected cause 
can be recorded, the more satisfactory will be the case history 
method. 

(4) A strong statistical association between the suspected 
cause and the end result will make either kind of study easier, but 
because of the relatively greater expense of a cohort study (due 
to the necessity of assembling data on large numbers of unaf- 
fected individuals), the benefits of such a strong association will 
be more apparent in this type of study. For example, although 
only 83 cases of leukemia were recorded during 12 years of obser- 
vation of the Hiroshima City population exposed to the atomic 
bomb (Wald, 1958), the relationship between degree of exposure 
and leukemia risk was so strong that it could be demonstrated 
unequivocally even on these numbers. (The leukemia rate for the 
most heavily exposed persons was about sixty times that for those 
lightly exposed.) Obviously, because of the even larger numbers 
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required, it is difficult to substantiate weak associations by means 
of cohort studies. 


EXPERIMENTAL EPIDEMIOLOGY 


Up to this point we have been discussing methods of determin- 
ing whether a statistical association exists between a certain en- 
vironmental situation and a certain disease. To a variable extent 
inference may be possible as to whether a demonstrated associa- 
tion is causal or not. However, as indicated in Chapter 2, proof of 
the causal nature of an association depends on experiment — the 
demonstration that alteration of the effect follows alteration of 
the reputed cause. 

Because of the extensive writings of Greenwood, Topley and 
Wilson (see, for example, Greenwood e¢ al., 1936), the expres- 
sion “experimental epidemiology” has been used to describe 
studies of the mechanisms of spread of infectious diseases in lab- 
oratory animals. However, the meaning of the expression may 
be broadened to include experimental studies of hypotheses of 
disease causation in man. 

In a way, the application of any public health measure that is 
based on knowledge of disease etiology may be considered an ex- 
periment, since it is a test of the validity of that knowledge. Ex- 
amples of such experiments range from the trials of various 
foods as antiscorbutics during the seventeenth and eighteenth 
centuries to the chronic disease control programs that are under 
trial at the present time. Epidemiologic observations are usually 
required to test the success of such measures. 

A more limited concept of experimental epidemiology is that 
it comprises studies or programs specifically designed and oper- 
ated to test etiologic hypotheses. Such studies differ from pro- 
grams that are set up primarily for the purpose of providing serv- 
ice in that control or comparison groups are usually provided for 
in the former but not in the latter. Whenever feasible it is de- 
sirable that experimental programs precede service programs. 
When this does not happen, there is the danger that unrelated 
events following the service program may be misinterpreted as 
evidence for or against the validity of the knowledge on which it 


50 Epidemiologic Methods 


is based. A downward trend in disease incidence may be wrongly 
interpreted as a consequence of the service program when it was 
due to other coincident changes, or a stationary incidence may be 
wrongly interpreted as evidence of the lack of effect of the service 
program if other factors were productive of increasing incidence 
during the same period. Once a therapeutic or preventive prac- 
tice has achieved acceptance, whether on sound or unsound 
grounds, it is very dificult to introduce experimental tests if in- 
dicated at a later date. 

Such considerations have led to a widening use of formal ex- 
perimental designs in the preliminary stages of both therapeutic 
and preventive procedures. In the preventive field, such studies 
have been designed to test the efficacy of various immunizing 
agents in the prevention of infectious diseases, to test the associa- 
tion of dietary deficiencies with specific disorders, and, most re- 
cently, to test the causal nature of the association between dental 
caries and fluoride-deficient water supplies. 


a 


Measurements of Disease 
Frequency 


Statements of the frequency of a given trait or disease manifesta- 
tion in various populations are essential to descriptive epidemi- 
ology. Such statements permit comparisons between populations 
and between subgroups of a population with respect to the mani- 
festation in question. In this chapter, the several measures of fre- 
quency in present use are described. Our concern is with the 
definition of these measures and with the theoretical concepts 
underlying them. Practical considerations that limit their accuracy 
are discussed in later chapters. 

In simplest terms a statement of frequency might be, “There 
were 500 cases of tuberculosis,’ the only requirement being a 
count of patients with tuberculosis. However, the statement has 
little utility until it is further qualified with respect to two essen- 
tial features: (1) in what population and (2) when were these cases 
observed. Thus, the statement would gain needed precision if it 
read, “On January 1, 1960, in the population of this town there 
were 500 cases of tuberculosis.’ For administrative purposes, such 
a measure of frequency might be entirely satisfactory. For example, 
information in this form would facilitate the planning of medical 
facilities for the tuberculous patient in that particular town, since 
such planning basically requires only knowledge of the number of 
cases and not necessarily of the probability of contracting tuber- 
culosis. 
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If, however, the purpose of the frequency statement is to de- 
termine whether one town contains mechanisms of greater or less 
strength in the production of tuberculosis than another, such a 
statement of frequency has obvious shortcomings. The second 
town may have more or fewer cases than the first because its pop- 
ulation is larger or smaller. To allow for differences in popula- 
tion size, the frequencies must be expressed in the form of rates. 


RATES 


A rate is a “quantity or degree of a thing measured per unit of 
something else” (Webster). In epidemiologic usage the “thing 
measured” is the trait or disease manifestation while the “unit of 
something else” is the population (diseased and healthy) to which 
the observed cases belong. The time at or during which the cases 
(trait or disease) and population are studied is a further specifica- 
tion which is essential for epidemiologic purposes. ‘Thus, the tu- 
berculosis frequency rate in a town on January 1, 1960 might be 
expressed as 500 cases per two million population (the actual popu- 
lation of the town). In practice, to enable rapid comparison with 
other rates, rates are expressed not in relation to total populations 
but in relation to certain convenient units of population size — 
usually some power of ten. The tuberculosis rate in the town 
might then be expressed as .025 per cent, .25 per thousand, or 
250 per million. Such rates are derived by dividing the number 
of cases by the actual population and multiplying by the chosen 
unit of size. 

It is evident, therefore, that three items of information are 
necessary in epidemiology for the expression of any rate — the 
numerator of the fraction (the number of persons affected), the 
denominator (the population among whom the affected persons 
are observed), and a specification of time. The denominator is 
commonly called the related or reference population. When the 
denominator is restricted solely to those persons who are capable 
of having or contracting the disease, it may properly be considered 
as a population at risk; the rates or probability statements thus 
derived are of particular pertinence to an understanding of eti- 
ology. 

Both the numerator and denominator of a rate must be de- 
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fined in similar terms; if the numerator is restricted to certain 
age, sex or racial groups, the denominator must be similarly re- 
stricted, and vice versa. For example, if the denominator popula- 
tion is the number of residents of a town, cases appearing in the 
numerator should also be limited to residents of that town. Sim- 
ilarly, if the numerator consists of the number of cases of a dis- 
ease limited to females (e.g., uterine disease), the denominator 
would necessarily be the related female population. 

The denominator of a rate may not be a “population” in the 
ordinary demographic sense. For example, a hospital may express 
its maternal mortality rate as so much per thousand deliveries. 
Although the women delivered do not form a “population” in 
the general sense, they may still be referred to as the related pop- 
ulation and in this case as a population at risk, since the statement 
derived describes the probability of death among a defined group 
at risk. Similarly, the case fatality rate is the number of deaths 
from that disease per so many patients with the disease; here the 
diseased group constitutes the population at risk. 

Ratios. Epidemiologic rates are in fact statements of probability, 
in the statistical sense. Certain measures, which in general usage 
might be called rates, are not probability statements, and are of- 
ten called ratios. One common form of ratio is the expression of 
the number of affected persons as a proportion of the number of 
unaffected persons (rather than as a proportion of the affected 
plus unaffected persons). For example in the United States it is 
common to express the number of fetal deaths as a proportion 
of the number of related livebirths. This is called the fetal death 
ratio and is distinguished from the fetal death rate which is the 
number of fetal deaths as a proportion of the related total births 
(livebirths plus fetal deaths). Ratios, although satisfactory for 
simple descriptive purposes, do not share with rates the advantage 
of being immediately amenable to treatment as probability sta- 
tistics. i 


DISEASE ACCOUNTING 


The amount of disease existing at a given point in time, like 
the level of a bank account at the end of a month, depends on 
the size and number of prior credit and debit transactions. Un- 
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like a bank balance, unfortunately, the disease balance is never 
found to be negative. But procedures similar to those descriptive 
of bank accounts are applicable in explaining disease levels, the 
difference being limited to the terms used in naming the amount 
on hand and the various forms of transaction (addition and re- 
moval). 


Point Prevalence (Amount on Hand) 


Point prevalence of a disease is its frequency at a given moment 
in time. It is a measure of what prevails or exists, and corresponds 
to the current bank balance. Expressed as a rate for a specified 
population, point prevalence is the proportion of that population 
who exhibit the disease at a particular instant. The numerator in- 
cludes all persons having the disease at the given moment, irrespec- 
tive of the length of time elapsed from the beginning of the illness 
to the time when the point prevalence is measured. The denomi- 
nator is the population among whom the disease is ascertained. 
Although prevalence is sometimes stated in the form of period* 
prevalence, our use of the word generally is restricted to point 
prevalence. 


Incidence (Addition) 


The amount of disease on hand at a given moment (point prev- 
alence) is the result of an interplay of dynamic forces. One of 
these forces is the incidence —the frequency of addition, or 
transaction into the disease account. In order to specify the fre- 
quency of such additions it is necessary to indicate the time dur- 
ing which cases are added. 

The incidence rate of a disease is defined as the number of cases 
of a disease that come into being during a specified period among 
a specified unit of population. 

In practice it is often impossible to measure incidence directly, 
since definite timing of an onset of illness is commonly uncertain. 
Usually certain arbitrary points in the disease process are used as 
approximations. Thus, measurements such as onset of symptoms, 


* The number of cases that were observed during (rather than at) a specified time, 
for example during a year. Period prevalence, therefore, consists of the point prev- 
alence at the beginning of a specified period of time plus all cases that arise during 
the period. Period prevalence is a particularly complex measure, since both point 
prevalence and incidence are incorporated in it. 
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time of diagnosis, and date of notification or of hospitalization 
are judged to be events related to how many cases arise within a 
specified period of time. Incidence rates, it should be noted, are 
measures of the frequency of events, as distinct from the exist- 
ences measured by prevalence rates. 

There are only two methods by which additions may be made 
to the disease account of a population: by the onset of illness 
(either an initial illness or a recurrence), and by ee etion of 
persons already ill into the population. 


Recovery or Death (Removal) 


The other dynamic force determining point prevalence is the 
removal from the disease account. There are three general ways 
by which the disease account may be depleted. For many diseases 
death is the most frequent means of leaving the population. A 
second frequent way is by recovery, either from the initial or a 
recurrent attack. The third way, emigration of persons with the 
disease, either by observed emigration from the population or 
by loss to observation, may be of importance in illnesses of long 
duration. 

Death. It is necessary to distinguish two rates descriptive of 
death: (1) the disease-specific death or mortality rate, and (2) 
the disease-specific case fatality rate. In both the numerator is 
the same — the number of persons dying of the disease during a 
certain period (see page 68 for discussion of why it is sometimes 
unnecessary to specify a period of time for case fatality rates). 
The denominators of the rates, however, are different. In a 
mortality rate the denominator is the total population within 
which the deaths occurred, while in a case fatality rate it is re- 
stricted to persons with the disease. Thus the mortality rate ex- 
presses the probability that a member of a defined total popula- 
tion will die of the disease, while the case fatality rate expresses 
the probability that a member of a population of affected individ- 
uals will die of the disease. The former rate is most useful to 
epidemiology since it is related, through the case fatality rate, to 
the incidence of the disease; the latter is most useful in describing 
the clinical course of the disease once it is established and in 
evaluating the efficacy of various treatments directed against it. 

Recovery. Ordinarily, the term recovery rate relates the number 
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of recoveries to the number of persons ill from the disease. It is 
an uncommon practice to relate the number of recoveries from a 
disease to the total population. 

Cause of death. In a disease-accounting procedure any indi- 
vidual known to die during the observation period must be with- 
drawn from the account. Such individuals may be divided into 
two groups — those who die because of the disease and those 
who die with the disease from some other cause. Although dis- 
tinction between these groups is unimportant for the basic ac- 
counting procedure, it becomes necessary to separate the two if 
determination of the numerator of mortality and case fatality 
rates is desired, since the concern in theory is with the first group 
only — those who die because of the disease. Two methods are 
used to ascertain this group: 

(1) Judgment, based on review of individual cases. For ex- 
ample, a woman with cancer of the cervix with bladder involve- 
ment who dies in uremic coma without evidence of other patho- 
logic states may be said to have died of cervical cancer, whereas had 
the same woman been killed in an automobile accident she would 
have been said to have died from an accident while suffering 
from cervical cancer. For such a judgment to be valid as an index 
of mortality attributable to cervical cancer requires that women 
with cervical cancer experience no greater mortality from acci- 
dents than do women in general. Such an assumption is of doubt- 
ful validity, even for the example chosen, and is still less certain 
for such associated conditions as diabetes and heart disease, nu- 
tritional defect and infection, and so on. On theoretical grounds, 
the method is unsound since it involves making judgments about 
causal association in individual events. As noted in Chapter 2, 
valid inferences concerning causal association can only be made 
for categories of events of which the individual events are merely 
examples. 

(2) Statistical assessment, based on the subtraction from the 
total number of affected individuals who die of that number who 
might have been expected to die if the general death rate among 
nonaffected individuals of similar age and sex had obtained. Al- 
though theoretically the superior method, this procedure requires 
complete ascertainment of the number of affected individuals at 
risk of death and of all who die, whatever the alleged cause. Al- 


Measurements of Disease Frequency 57 


though rarely practicable for deriving mortality rates, this 
method often does have applicability in estimating attributable 
case fatality. The statistical considerations in this method have 
been described by Muench (1959). 

An important function of death data in epidemiology is to pro- 
vide crude estimates of incidence or prevalence when direct esti- 
mates cannot be made. When death data are put to such use, it 
should be borne in mind that most national and other vital data 
on mortality rates from specific diseases are dependent on jude- 
ments concerning individual cases as described under (1), page 
56. Since 1948 in the United States and 1940 in Great Britain, 
deaths have been tabulated according to the “underlying cause 
of death” as selected by the certifying physician from the various 
pathologic states that might be present at the time of death. 
While grossly inadequate with respect to certain conditions (e.g., 
diabetes), the statistics gained in this way yield highly accurate 
measures of mortality from certain others (e.g., acute leukemia). 
Mortality rates so derived are most satisfactory as indices of either 
incidence or prevalence of conditions which (a) are likely to be 
selected as the underlying cause if present at the time of death, and 
(b) run a sufficiently short course that the probability of affected 
persons dying in conjunction with other reported causes will be 
small. Death from a disease is a particularly close index of inci- 
dence (as contrasted to prevalence) when the disease is uniformly 
fatal. On the other hand, when the probability of a person having 
died from one particular cause is high, the frequency of other man- 
ifestations present at the time of death approximates prevalence. 
A recent example is the use of autopsy material from soldiers killed 
in combat to estimate the prevalence of atherosclerosis (Enos et al., 
1955). 

If the method of statistical assessment is used to derive the 
numerator for an attributable case fatality rate, the number of 
deaths obtained will at times be substantially larger than the 
number which would have been obtained through use of the 
method of individual judgments. This disparity will be least for 
diseases which run a short course and greatest for chronic diseases, 
where, in fact, deaths from “other’’ causes may far exceed those 
which attending physicians would have specified as having direct 
connection with the disease in question. 
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Example of Measures of Frequency 


To illustrate some of the measures of frequency, use will be 
made of the data shown in Tables 4 and 5 on cases of leukemia 
that are known to have existed in the white population of Brook- 
lyn, New York, between 1948 and 1952. For the present purpose, 
we are assuming that these represent all the cases of leukemia in 
the white population of the city, which in 1950 numbered 2,525,- 
000. The tables show the accounting of cases for each year in the 
period. The total cases existent in each year (column 3) are de- 
rived from those patients alive at the beginning of the year who 
had received a diagnosis of leukemia in a previous year (column 
1), plus cases newly detected in that year (column 2). Columns 


TABLE 4 


Data on 417 patients with acute leukemia, Brooklyn, 1948-1952 * 


(1) (2) (3) (4) (5) (6) 
Diagnosed Newcases Total cases Patients Patiense 
Ye: Pili Deaths alive 
ear inapre- diagnosed existent : lost trace 
ae : in year : at end 
vious year in year in year of in year 
of year 
1948 a 69 76 54 7 15 
1949 15 91 106 86 3 17 
1950 17 83 100 73 3 24 
1951 2 99 123 101 ] Zi 
1952 21 68 89 81 1 7 
Total 84 410 494 395 }5 84 


* There were 7 patients alive at the beginning of the period whose acute leu- 
kemia had been diagnosed previously. Diagnosis of 410 new cases was made during 
the period, for a total of 417 cases known to have existed during the period. Of 
this total 395 patients died during the study period, 15 were lost trace of during 
the period, and 7 were still alive at the end of the period. 


4, 5 and 6 show the disposition during the year of the cases in 
column 3. The number of patients known to be alive at the end 
of the year (column 6), is, of course, the same as the number alive 
at the beginning of the next year (column 1), 

Prevalence. Patients alive at the beginning of the year whose 
disease was diagnosed in a previous year (column 1) are in fact the 
cases of leukemia existent at 00.00 a.m. on January | of each year. 
They form, therefore, estimates of point prevalence. Point prev- 
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TABLE 5 
Data on 495 patients with chronic leukemia, Brooklyn, 1948-1952 * 
(1) (2) (3) (4) (5) (6) 
Diagnosed Newcases Total cases Patients vec 
: ; ; Deaths alive 
Year inapre- diagnosed _ existent lost trace 
: : ‘ in year ; at end 
vious year in year in year of in year 
of year 
1948 129 79 208 50 8 150 
1949 150 83 233 71 5 157 
1950 157 90 yt | 90 6 151 
1951 151 61 212 84 - 121 
1952 12] 53 174 89 4 81 
Total 708 366 1074 384 30 660 


* There were 129 patients with chronic leukemia alive at the beginning of the 
survey period whose disease had been diagnosed in previous years. Diagnosis of 366 
new cases was made during the period, for a total of 495 cases known to have ex- 
isted during the period. Of this total, 384 patients died during the study period, 30 
were lost trace of, and 81 were still alive at the end of the period. 


alence rates can be derived by relating these numbers to the 
population at risk. For acute leukemia the average point preva- 
lence for these five points in time is: 


84 108 
BX 5555,000 


Column 3 shows the numbers of cases existent during each year. 
Average annual period prevalence rates can therefore be derived. 
For acute leukemia, the rate is: 


ae 5 EOI = 39.1 per million. 

An important characteristic of period prevalence rates is evi- 
dent in these data. The period prevalence does not bear a direct 
relationship to the length of the period of observation. For ex- 
ample, the 5 year period prevalence (7 + 410 = 417) is less 
than five times the average annual period prevalence (494/5 — 
98.8). Clearly some patients who would be included only once in 
a 5 year prevalence would be included several times in successive 
annual period prevalences. 

Incidence. Time of diagnosis is used here as the basis for de- 
termining incidence. Incidence rates are derived from column 2. 
The 5 year incidence rate for acute leukemia is: 


== 6.7 per million. 
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410 >< 108 
5,595,000 


Since incidence rates are usually expressed per annum, this may 
be divided by 5 to derive an average annual incidence rate of 32.4 
per million per year. 

Mortality rate. The mortality rate is derived from column 4. 
For acute leukemia the average annual death rate is: 


395 
2,525,000 


This estimate is based on the assumption that all deaths of af- 
fected persons were caused by leukemia. Actually, on the basis of 
the age-specific death rates from all causes in the general popula- 
tion at the time, one of the patients with acute leukemia and 18 of 
those with chronic leukemia might have been expected to die 
from other causes during the 5 year period. The subtraction of 
these cases from the total deaths makes little difference to the 
mortality rate, and the assumption seems to be a reasonable one 
for this particular disease. 

The various rates described in this section are summarized for 
both forms of leukemia in Table 6. 


== 162.4 million per 5 years. 


6 
x = 3].2 per million per year. 


TABLE 6 


Frequency rates for acute and chronic leukemia derived from the data 
in Table 4 and Table 5 


Average annual rates Acute Chronic 

per million population leukemia leukemia 
Incidence 32.4 29.0 
Death (mortality) 4 30.4 
Period prevalence 39.1 85.1 
Point prevalence (not annual) 6.7 56.1 


INTERRELATION OF INCIDENCE AND PREVALENCE 


There is an important relationship between prevalence and in- 
cidence which may be expressed as follows: 


P; varies as the product of J(;) and d 


where P; is point prevalence at the end of the period (¢), J; is inci- 
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dence during time (¢), and d is the average duration of the disease 
from onset* to termination. 

It follows that a change in point prevalence from one period 
to the next may be the result of changes (1) in incidence, (2) in 
duration, or (3) in both incidence and duration. For example, 
improvements in therapy, by preventing death but at the same 
time not producing recovery, may give rise to the apparently 
paradoxical effect of an increase in prevalence of the disease. And 
decrease in prevalence may result not only from decrease in inci- 
dence but also from a shortening of the duration of illness — 
through either more rapid recovery or more rapid death. 

If incidence and duration are constant over a succession of time 
periods the disease is said to be in equilibrium and: 


P, oe 


In this circumstance the relationship between prevalence, inci- 
dence and duration is such that: 


P. == LT ayd. 


For a disease in equilibrium, this formula may be used to calcu- 
late the other component of the equation when two are known. 
For example, from the estimates of prevalence and incidence of 
leukemia in Table 6 the average duration of the disease can be 


calculated from the relationship d =: 
(t) 
6.7 
acute leukemia: d = COP ta 0.21 years = 2.5 months. 
. 56.1] 
chronic leukemia: d = 0 = 1.93 years — 23 months. 


These are close to the values of 2.4 months for acute leukemia 
and 20 months for chronic leukemia derived independently 
from special follow-up studies of the same patients. ‘These dura- 
tions represent the interval between diagnosis and death, since 
incidence here is based on date of diagnosis and death is the only 
avenue of removal from this disease account (other than emigra- 


* If incidence is measured from some stage other than onset (e.g., diagnosis), dura- 
tion is measured from the same stage. Prevalence must also include only cases 
existent at or subsequent to the same point in the disease process. 
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tion with disease). Note that the formula gives duration in the 
same time unit as that used for incidence (in this case, years). 

The above formula is most useful in consideration of diseases 
in which the predominating form of removal is either recovery 
or death. It must be remembered that duration to death in some 
diseases is shorter, and in others longer, than duration to recovery. 

Another useful relationship is that between incidence, J(;), and 
mortality, Mi), which may be used to measure the case fatality 
rate, f: 


Thus, for acute leukemia (see Table 6) f = 31.2/32.4 = 0.96, and, 
for chronic leukemia, f = 30.4/29.0 = 1.05. The findings are con- 
sistent with the general conception that both forms of leukemia 
are invariably fatal. Calculations dependent on this relationship 
should again be restricted to instances in which (1) the disease is 
in equilibrium, and (2) the reports of death and incidence are 
equally complete. If the disease is not in equilibrium, but inci- 
dence is either increasing or decreasing from one time to the next, 
the ratio mortality/incidence will be lower or higher, respectively, 
than the case fatality. Such a discrepancy becomes larger with in- 
creasing lengths of time from onset to death. 


MEASUREMENTS PARTICULARLY RELEVANT TO 
EPIDEMIOLOGY 


A variety of rates have been described which are related to the 
frequency of a disease in a population. The use of one rather than 
another is dependent on the relevance of the one selected to the 
particular problem in mind. Where the problem is epidemiologic 
the rate of choice obviously is incidence. 


Incidence 


Attempts at elucidation of causal factors are concerned with an 
event or occurrence — becoming ill—and the measure most di- 
rectly descriptive of this event is the incidence rate of the disease. 
Causal factors necessarily operate prior to the “onset” of disease. 
Therefore, the closer the stage of disease at which incidence is 
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measured comes to the time of actual onset, the more directly “‘in- 
cidence” will be influenced by the operation of causal factors. 
Incidence, potentially, is the most direct measure for comparing 
populations with respect to causal factors. 


Death Rates as Indices of Incidence 


The main reason why death rates are so often used as indices 
of incidence is their ready availability. Routine registration of 
deaths together with specification of ‘cause’ of death enables 
death rates to be derived for many diseases and for many popu- 
lations for which more direct measures of incidence are not avail- 
able. Note should again be taken that death rates reflect the 
characteristics of incidence rates most closely for a particular dis- 
ease when the interval between onset and death is short and when 
the disease in question has a high case fatality rate. 


Prevalence 


For epidemiologic purposes point prevalence measurements are 
inferior to incidence measurements, since two sets of factors de- 
termine point prevalence: those connected with the occurrence of 
cases, and those connected with course and duration (prognosis) 
of the cases once they are established. Although the same factors 
may at times be operative in both mechanisms, there would ap- 
pear to be a much greater chance of correctly assigning the role 
of individual factors if the two processes were studied separately 
by investigation of incidence rates on the one hand, and of prog- 
nosis on the other. When substantial incidence data are scarce, 
point prevalence data at times give clues as to incidence, but the 
contribution of duration to point prevalence must be constantly 
borne in mind. 

On the other hand, many administrative situations require 
knowledge of how many patients with a given disease exist in 
the community. Clearly the answer to such a question rests in 
point prevalence. Incidence rates are of uncertain value since, for 
a given incidence rate, diseases of long duration impose a greater 
community burden than those of short duration. Even here, how- 
ever, incidence rates may be of considerable importance if, as is 
often the case, facilities for new cases differ from those for patients 
who do not recover within a short period of time. 
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Although period prevalence is less useful epidemiologically 
than other measures of disease frequency, it is given attention 
here because of its recurring use in an epidemiologic context. 
Period prevalence is a compound of (1) a past point prevalence 
(the resultant in turn of a previous interrelation between inci- 
dence and duration) and (2) a more recent incidence. It is 
therefore influenced by variation from a large number of factors. 
Furthermore, because there is no direct relationship between pe- 
riod prevalences measured over different periods, period preva- 
lence is a most difficult measure to evaluate. 

Theoretically, period prevalence could be used to estimate in- 
cidence or point prevalence if one or the other of these were 
known, P¢) = P; + J). But in practice if data are available to es- 
timate period prevalence as well as one of the other measures, they 
are usually available for direct calculation of the third measure. 

Occasionally, period prevalence data are used as estimates of in- 
cidence or point prevalence. The extent to which this is justifi- 
able depends on, among other things, the following considera- 
tions: 

(1) The longer the period over which period prevalence is 
determined, the closer period prevalence resembles incidence. 

(2) The shorter the period over which period prevalence is 
determined, the closer period prevalence resembles point preva- 
lence. 

(3) ‘The shorter the average duration of disease, the closer pe- 
riod prevalence resembles incidence. 

(4) The longer the average duration of disease the closer 
period prevalence resembles point prevalence. 


SPECIFICATION OF TIME 


We have seen that an incidence rate measures the frequency 
of events in a certain sized population over a specified period of 
time. In specifying the period over which the events are observed, 
two things must be defined: (1) the point in time at which the 
observation began, and (2) the length of the period of observa- 
tion. In the case of point prevalence rates no duration of observa- 
tion is involved, but it is still necessary to specify the point in time 
at which the observations are made. 
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Points in Time 


So far we have spoken of time as if calendar time were neces- 
sarily implied. Actually there are several ways of specifying time. 

One way is by the specification of a calendar time. If, for 
example, we wished to estimate the incidence of coronary throm- 
bosis in men aged 45, we could observe all cases of coronary throm- 
bosis in men aged 45 with onset between January | and Decem- 
ber 31, 1958, and relate this to the population of 45-year-old men 
at the middle of the year, to derive an annual incidence rate. 
Since the date of occurrence of events is very frequently recorded 
and it is consequently relatively simple to decide whether or not 
an event occurred between two calendar dates, this is the most 
common way of specifying the beginning and end of a period of 
observation. 

Similarly, it is common to measure point prevalence at a point 
in calendar time. National censuses, for example, which measure 
the point prevalence of people, are taken as of a specified date. 

A second way to specify time is by using chronologic age of the 
population. In the previous example of coronary thrombosis in 
45-year-old men, an alternative method would be to observe a 
population of men who had just attained their forty-fifth birth- 
day. Since not all men reach their forty-fifth birthday at the same 
time, the periods of observation would not all coincide in calen- 
dar time. 

A common and more practical use of chronologic age is in a 
situation in which there is interest in a disease occurring within 
a short time of birth. For example, in a maternity hospital over a 
certain period of time there may have been 2000 deliveries of 
liveborn children, 22 of whom died in the first week after birth. 
The death rate in the first week of life may then be expressed as 
1] per 1000 livebirths. Not all these children were observed dur- 
ing the same period in calendar time, and it does not matter for 
this purpose whether the 2000 births occurred over a period of | 
year or of 10 years. The period of observation is specified as be- 
ginning at a certain age (birth) and continuing for 7 days there- 
after. 

In the case of point prevalence data, an example of a similar 
type of measurement might be a statement that by age 10 years 
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20 per cent of a certain population are tuberculin positive. Most 
commonly, such a measurement would be defined in terms of 
both age and calendar time. Thus the statement might read “In 
1959, at age 10 years, 20 per cent of this population are tuberculin 
positive.” However, derivation of a point prevalence requires 
specification of a point in time, not a period as vague as a year, 
and it is neither practical nor desirable to specify this point in 
terms of both variables (i.e., measuring the tuberculin sensitivity 
of persons all reaching their tenth birthday at the same point in 
time). For purposes of assembling the data, the specification may 
be in terms either of age or of calendar time. ‘Thus according to 
the type of specification used, the statement might more pre- 
cisely read “Among persons reaching their 10th birthday at some 
time in 1959, 20 per cent were tuberculin positive,” or “Among 
persons who were ten years old on July I, 1959, 20 per cent were 
tuberculin positive.” The implications of these two statements 
are for all practical purposes the same, but the method of assem- 
bling the data is quite different. 

Occasionally this type of rate is implied in the way a rate is 
expressed, although the actual method of derivation of the rate 
may be on the basis of a specified calendar time. Thus, in the 
previous neonatal mortality example (see page 65), the hospi- 
tal may have been able to observe each of the infants born for a 
period of 7 days and to record deaths occurring in the actual de- 
nominator population. In the case of the much more frequently 
used infant mortality rate, however, this may not be possible, 
since this would necessitate observation of the denominator popu- 
lation for one year. It is usual therefore to express the infant 
mortality rate as: 


Number of deaths of infants aged less than one year 
Te KX LOO 
Number of livebirths during the same year 

The implicit assumption is that the number of livebirths dur- 
ing the period of observation is the source of all the infant deaths 
occurring during the same period. Since infant deaths are heavily 
concentrated in the early part of the first year of life, this is not 
an unreasonable assumption, particularly when the annual num- 
ber of births is relatively constant. 
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In theory the relationship P; = I,:)d holds whether age or cal- 
endar time is the point of reference. However, the same proviso 
holds for age as for calendar time: the disease must be in equilib- 
rium. In the situation where age is the point of reference this 
means that neither incidence nor duration changes appreciably 
with age. In fact we know of no disease for which this is true; 
changes in incidence with age are usually so marked as to make 
calculations from the formula of little value. The general prin- 
ciple, however, that point prevalence is a function of incidence 
and duration remains the same, and point prevalence at a specified 
age depends on: 

(1) Incidence at younger ages. 
(2) Duration. This in turn depends on: 
(a) Continuance of the observed state. 
(b) Mortality of persons manifesting the state relative to 
those not manifesting it. 

An example of this principle is observation of characteristics 
evident at birth, such as congenital malformations. Although 
such observations are frequently expressed as incidence rates 
(e.g., 1.3 cases of harelip per 1000 births), they are in fact point 
prevalence rates, since they are observations of the frequency of 
existence of certain states at a certain age (birth). The events 
which might be used to express true incidence rates (the occur- 
rence of harelip) occurred at least 6 months before birth. What 
such rates are called is of less concern than an understanding that 
they have the characteristics of point prevalence rates in being 
simultaneously influenced by both incidence and duration. In this 
case, since natural recovery from harelip does not occur, the 
most important determinant of duration is survival. Differences 
in the frequency with which congenitally malformed fetuses die 
and are aborted will produce differences in the point prevalence 
of congenital malformations at birth, even though the incidence 
of malformation is the same. 

A third method of specifying the time of onset of an observa- 
tion is according to the appearance of some characteristic or the 
occurrence of some experience that is common to a group of per- 
sons but without clear specification of either age or chronologic 
time. Thus, we speak of the incidence of complications of a cer- 
tain disease — the proportion of patients with the disease who ex- 
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perience complications. Another example is the incidence of com- 
plications of pregnancy. Case fatality rates are similar types of 
statements. None of these rates bear much relationship to rates 
used for describing disease frequency in the population at large. 

The prevalence of defects among military inductees or among 
school entrants forms a special class. Here there is a tacit specifica- 
tion of age, and the frequency of defects noted may, in some cir- 
cumstances, be considered representative of the population from 
which the entrants were drawn. 


Lengths of Time 


An incidence rate involves specification of the period of time 
during which events have occurred. The use of arbitrary periods 
such as weeks, months or years enables ready comparison of data 
from several studies. 

However, in measurement of incidence rates of some diseases 
or events the population may be at risk for a limited period of 
time only. In such cases it may be possible to express a “total” in- 
cidence that would remain the same regardless of the length of 
observation. The limitation of the period of risk may be due to 
the action of etiologic influences that exist for only a short period 
of calendar time, or to the restriction of risk to certain age 
groups. For example, this situation frequently exists when inci- 
dence is being measured in relation to the number of births 
(population), since many diseases are limited to infancy. ‘Thus, in 
Birmingham, England, between 1940 and 1949 there were 1 94.,- 
216 livebirths, and 578 of these infants developed hypertrophic 
pyloric stenosis (MacMahon et al., 1951). The incidence of py- 
loric stenosis may be expressed as 578/194,216 x 1000 or 2.98 
per 1000 livebirths without any specification of the duration of 
observation. Since this condition occurs predominantly in the first 
3 months of life and is practically unknown over 6 months of age, 
a relatively short observation period ensures the derivation of a 
total incidence, which will not be appreciably increased by in- 
creasing the observation period. A similar situation exists with 
respect to the measurement of the frequency of complications of 
pregnancy and of specific acute diseases. In the measurement of 
case fatality rates in chronic diseases it is usually necessary to 
specify a period during which the case fatality or the frequency 
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of other complications is being measured. It is common, for ex- 
ample, to speak of 1 year, 5 year or 10 year survival rates. 

It is clearly not possible to derive such a ‘‘total’”’ incidence rate 
for diseases which can be contracted at any time in life. An ap- 
proach to it is the life time probability of contracting or dying 
from a given disease. This is not an observed rate but is a theoreti- 
cal expectation derived from currently observed incidence or 
death rates in each age group of the population under the as- 
sumption that the currently observed rates will continue to oper- 
ate throughout the life of the population in question. It is nec- 
essary further to assume an arbitrary life span, for example, 70 
years, while a true “‘total’’ incidence rate for diseases which can 
be contracted at any time in life would be the probability of 
contracting or dying from the disease regardless of how long the 
person lived; as has been noted, this may be determined only 
when the period of risk is limited to a certain period of time or 
age span. 

A “total” incidence resulting from limitation of risk to a period 
of calendar time is illustrated by an attack rate. This rate con- 
tains in the numerator all the events occurring during an arbitrary 
period of time usually encompassing some circumstance such as the 
passage of an acute epidemic. 

A secondary attack rate is a measurement in which the numer- 
ator consists of the number of cases of a disease which occur 
within the same household following the occurrence of the first, 
or ‘primary,’ case. It is usually used in connection with infectious 
disease, and there is usually a stated or implied time limitation 
that, on the basis of the known incubation period of the particu- 
lar disease, indicates that the secondary cases probably derived 
from the primary case. The denominator in a secondary attack 
rate is usually the population at risk, rather than the reference 
population, in that it is restricted to groups actually at risk and, 
in diseases conferring prolonged immunity, excludes persons who 
have previously had the disease. 


SPECIFICATION OF THE EVENT 


The general type of event — onset of illness — which goes into 
the numerator of incidence rates requires further specification. 
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If expressed as a rate, incidence (unlike point prevalence) is 
not necessarily a fraction of the population. The numerator may 
be made up of cases or attacks rather than individuals, since in 
some illnesses, such as acute alcoholic delirium or the common 
cold, some individuals may experience more than one attack dur- 
ing the period of observation. In such a circumstance the prob- 
ability statement derived measures the risk of cases of the illness 
occurring among the population rather than the risk of individ- 
uals becoming ill. It would seem preferable, for epidemiologic 
purposes, to reduce the numerator to individuals so that a prob- 
ability of individuals having one or more attacks during the 
interval might be expressed. Further subdivision into risks of 
having 1, 2,3 . . . attacks during the period are useful in identi- 
fying and describing groups of individuals who have an abnor- 
mal frequency of attack. 

In addition, although attacked only once during the period of 
interest, some individuals may be experiencing a recurrence of 
such an illness as reactive depression, their initial attack having 
occurred several years before the period of observation. Under 
this condition an incidence rate expresses the risk of individuals 
having an attack during the period of interest, whereas it would 
have been more helpful to an understanding of basic causal 
mechanisms to identify those individuals having their initial 
episode of the illness during the period of interest. A further 
reason for avoiding inclusion of persons having a recurrence is 
particularly applicable to diseases in which recovery occurs in 
only a fraction of the initial episodes. The characteristics of ind1- 
viduals who have recovered, and therefore are susceptible to a 
recurrence, are of necessity related to recoverability. Inclusion of 
such persons in the numerator of an incidence rate would seem 
likely to becloud the features distinctive to persons having their 
first episode. 


SPECIFICATION OF THE POPULATION 


The denominator of incidence rates also presents problems be- 
yond those of identifying its geographic or administrative bound- 
aries that were discussed earlier. If the denominators of incidence 
rates contain people who are not at risk of contracting the dis- 
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ease, important epidemiologic differences between the rates may 
be obscured. 

In those acute infectious diseases which confer permanent im- 
munity the majority of a specified population may not be at risk 
of attack because of their past encounter with the disease. In 
common chronic or relapsing diseases of extremely long average 
total duration, the denominator may in turn contain an appreci- 
able proportion of persons who are either currently ill with the 
disease or have had it in the past and are, therefore, not at risk 
of having an initial attack. 

Ideally an incidence rate, in order to express true “risk,” would 
have its population base confined to those persons who have not 
in the past had and do not presently have the disease under study, 
but this is a refinement which has not been commonly practiced, 
and which in fact may be ignored without serious error unless 
the proportion of such persons in the population is large. 


6 


Sources of Mortality and 
Morbidity Data 


An epidemiologist obviously must depend on observations 
made by many others. One person rarely has the opportunity to 
observe a sufficiently large population (including both affected 
and unaffected individuals) to arrive at firm estimates of disease 
frequency. Implicit in this is the limitation that data constituting 
the principle sources of information are frequently gathered and 
published for different purposes than those in which the epidemi- 
ologist is interested, and may consequently not suit his purpose 
completely. 


DEATH REGISTRATION 


The introduction of death registration was the foundation of 
modern epidemiology; it turned the subject from a narrative dis- 
cipline into a science that depends on quantitative arguments for 
deduction and generalization. For an account of the history of 
death registration the reader is referred to a recent publication by 
Dorn (1959a). The primary reasons for recording deaths are of 
course legal and political. 

Deaths reported by cause were the principal reliance of epi- 
demiologic intelligence in early times. ‘They supplanted the less 
specific total death rate as measures of disease prevalence, al- 
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though even today changes in the total death rate may give the 
first indication of epidemic conditions (see Figure 1, page 6). 
The specification of cause of death has over centuries followed 
developments in medical knowledge and in concepts of manifesta- 
tional and experiential classification; international agreements 
have been established, and more nations are approaching com- 
pleteness in reporting. ““Completeness’”’ means not only that all 
actual deaths are recorded, but also that assigned causes are shifted 
from vague and ambiguous categories, such as ‘““marasmus” and 
“senility,” to ones that are as explicit as possible in light of pre- 
vailing knowledge. 

Death registration contributes importantly to knowledge of the 
history of certain diseases through secular trends in mortality. In 
the analysis of such trends, certain difficulties must be considered. 

First, it is necessary to take account of changes in the complete- 
ness of registration of deaths. In the United States, satisfactory 
levels of registration have been achieved by the different states 
at different times. Admission to the “Death Registration Area” 
— the area for which mortality data are published by the Na- 
tional Office of Vital Statistics — has been dependent upon achieve- 
ment of satisfactory levels of registration, and consequently the 
geographic area included in the Death Registration Area has var- 
ied from time to time. At the time of its original formation in 
1880 the Death Registration Area included only two states — 
Massachusetts and New Jersey. The Area became complete, that 
is, comprised all the then existent states, with the inclusion of 
Texas in 1933. Since death rates vary from state to state, as well 
as from time to time, these additions to the Registration Area 
must be considered in the examination of time trends. Trends 
for the years betweeen 1900 and 1933 are most accurately deter- 
minded by restriction of data to the ten registration states which 
provided acceptable data throughout that period. In England and 
Wales, long-standing traditions and uniformity of registration pro- 
cedures are responsible for satisfactory levels of registration at 
least since the beginning of the present century. 

The successive revisions of the Manual of the International 
Statistical Classification of Diseases, Injuries, and Causes of Death 
(the “International List’’), an internationally accepted guide for 
classification and coding of causes of death, may also profoundly 
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influence the apparent trends of mortality of certain diseases as 
tabulated in published statistics. This is particularly true with re- 
spect to the Sixth Revision in 1948. The identification of changes 
attributable to such revisions is usually easily made by noting the 
abruptness of the changes and their coincidence with institution 
of the revisions. Making allowance for changes due to the revi- 
sions is less easy, although there have been special studies of their 
effects on various causes of death (Van Buren, 1940; Dunn and 
Shackley, 1944; World Health Organization, 1952). 

Another source of confusion results from the changes that have 
been made in the procedure for classifying death certificates on 
which the attending physician has noted the presence of more 
than one pathologic condition. Since practically all published 
statistics deal with dead persons as if each had but one cause of 
death, it is necessary to assign each certificate to some one rubric 
of the International List even though the certificate might logi- 
cally satisfy criteria for any of several. For many years it was cus- 
tomary to assign such certificates on the basis of certain arbitrary 
rules established in an internationally accepted Manual of Joint 
Cause of Death; thus, a certificate mentioning both heart disease 
and diabetes mellitus would automatically have been assigned to 
diabetes, even though the certifying physician might have con- 
sidered heart disease the underlying cause of death and diabetes 
only a coincident condition. Following the recommendations of 
various international bodies, Great Britain in 1940 and the United 
States in 1949 began assigning certificates bearing multiple causes 
on the basis of the certifying physician’s opinion as to the “un- 
derlying cause” of death. This change had a profound effect on 
the statistics of some diseases. For example, diabetes mellitus 
showed an abrupt decline of between 30 and 50 per cent in the 
number of assigned deaths. Once again, special studies have been 
made which assist in the interpretation of data that span the 
years of change (Registrar General, 1947; Erhardt and Weiner, 
1950; National Office of Vital Statistics, 1951). 

Finally, the formula M @) = I(t) & f (see Chapter 5, page 62) in- 
dicates that changes in mortality may result from change in either 
incidence or fatality. This relationship, while contributing to the 
difficulties in the interpretation of mortality trends, may also 
occasionally be put to good use. Thus estimates may be made 
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retrospectively either of incidence from known case fatality or of 
prevalence from knowledge of both case fatality and duration. 
For example, before the antibiotic era, the prevalence of tuber- 
culosis in a population could be estimated at ten times its mor- 
tality, based on an average 5 years’ duration and 50 per cent case 
fatality for this disease. Currently, this estimate must be modi- 
fied because of lower case fatality, and an average duration still 
to be determined. 

The improvement in accuracy and completeness in death regis- 
tration furthers historical studies of disease, but the consequent 
increased complexity of registration procedures has some draw- 
backs in terms of immediate usefulness. Processing, classifying, 
listing, and publishing the contents of death certificates delay the 
distribution of information on current epidemics. Total numbers 
of deaths on the other hand can be made available in a relatively 
short time. The 1952 fog in London, referred to previously 
(page 5), focused attention on the value of “excess total deaths.” 
The United States Public Health Service, National Office of Vital 
Statistics, publishes weekly reports of the number of deaths in 
114 selected cities. During the influenza pandemic in 1957 these 
numbers were plotted and compared with the median number 
of weekly deaths for the preceding 3 years in the same cities 
(Figure 6); the excess mortality was quite strikingly demon- 
strated. 


Death Reporting in Limited Populations 


Official death registration has the main purpose of estimating 
mortality in the total population. Deaths are also reported in 
limited portions of the population and these data may be used 
for specific epidemiologic purposes. For example, some insurance 
companies have analyzed in detail deaths occurring among their 
policy holders. Risk of death has been related to attributes such 
as social class, and physical attributes, such as weight, for which 
data are not available in national statistics. 

Reports of deaths among the Armed Forces under noncombat 
conditions have greater than usual accuracy and completeness, 
but naturally pertain to a selected population of limited age and 
sex range. 
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MORBIDITY REPORTING 


Official reporting of disease occurrence occasions many more dif- 
ficulties than death reporting. Apart from certain limited popula- 
tions, such as the Armed Forces, in which there is distinct admin- 
istrative interest in morbidity data, the motivation for reporting 
ordinarily originates from health officials. ‘The routine sources of 
morbidity data to which the epidemiologist can turn without 
conducting a special survey include: 

(1) Reporting of certain acute diseases to health authorities. 
(2) Registers of certain diseases. 

(3) Reports on absenteeism from work or school. 

(4) Medical statistics of the Armed Forces. 

(5) Hospital reports. 

(6) Other sources. 


Disease Reporting 


Legislation requiring the reporting of certain diseases at onset 
has the objective of maintaining a current picture of the danger 
which these diseases present to the community. Since a specific 
diagnosis is required, reporting is usually the responsibility of 
practicing physicians. Reports are made to local health authorities 
so that control] measures may be initiated with the shortest pos- 
sible delay. The selection of diseases to be reported is mostly a 
local matter except for six diseases — cholera, plague, louse-borne 
relapsing fever, smallpox, louse-borne typhus fever, and yellow 
fever — the reporting of which is required by International Sani- 
tary Regulation. Reportable (or “notifiable’”’) diseases are usually 
either infectious diseases or diseases believed to be occupationally 
determined. 

There are two forms of reporting: (1) reporting of cases and 
(2) reporting of epidemics. Reports of epidemics may be made in 
the event of unusual outbreaks of a disease, whether or not the 
disease in question is included on the regular list of reportable 
diseases, and even, in fact, when the nature of the epidemic is un- 
known. | 

Although the purpose of disease reporting is to obtain inci- 
dence data, generally the number of cases reported is far fewer 
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than the total cases occurring. However, valuable inferences can 
be derived from the observation of relative fluctuations. For ex- 
ample, the trend of reported cases over several years may be 
assumed to be proportional to the actual incidence trend (provid- 
ing the period is not too long). 

Apart from their use for initiating control measures, the 
value of reports of epidemics lies mainly in forming a picture of 
the geographic distribution of disease. For example, during the 
influenza epidemic of 1957 there was published a weekly map 
that showed the counties of the United States that had reported 
epidemic conditions similar to influenza (Communicable Disease 
Center, 1957). For certain diseases — pohomyelitis, influenza, and 
arthropod-borne encephalomyelitis — the Communicable Disease 
Center, United States Public Health Service, issues periodic 
surveillance reports based on data collected from all over the 
United States that include many details of unusual completeness. 


Disease Registers 


Some health authorities attempt to maintain a continuous ac- 
counting of patients with diseases of major public health signifi- 
cance, such as pulmonary tuberculosis and cancer, and of patients 
hospitalized for mental disorders. Newly recognized cases are en- 
tered and maintained in the file until recovery, death, or migra- 
tion from the community. Obviously, it takes several years to 
build up such a file to completeness. Potentially, these registers 
are sources of accurate information as to duration of disease, case 
fatality, incidence, prevalence, and other pertinent epidemiologic 
measurements. Official reports from such files are often of an ad- 
ministrative nature, dealing with the medical care and welfare 
assistance to which patients are entitled by local regulations. 
However, the original data and the reports that are occasionally 
published in great detail, such as the report of the experience of 
Connecticut cancer registration between 1935 and 1951 (Gris- 
wold et al., 1955), are valuable sources of epidemiologic informa- 
tion. 

An example of a particularly complete disease register is the 
pulmonary tuberculosis register kept by the Central Tuberculosis 
Dispensary for the City of Copenhagen since early 1930. The 
main entries of the annual reports are compiled in Table 7 for 
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the years 1949 through 1956. The table demonstrates some of the 
purely administrative problems that influence prevalence data, 
such as the decision to change the criterion for recovery (freedom 
from Mycobacterium tuberculosis) from 5 years to 3 years, and 
the changes in procedure with respect to the “lost’’ cases. Com- 


TABLE 7 


Itemized accounting of prevalence of pulmonary tuberculosis in a City * 


Disease entry 1949 1950 1951 1952 1953 1954 1955 1956 


(1) Cases on file Janu- 


uary | 7423 7272 «#26853 «466425 4761 4275 3745 3413 
(2) New cases during the 
year 79 686 649 S572 05m) 437 372 


(3) Relapses after 3 yrs. 

freedom from Myco- 

bacterium tubercu- 

losis os — -— — 59 86 =—.:110 fi 
(4) Persons with tuber- 

culosis entering com- 


munity oe id 86128 99: = ik 78 66 69 
(5) Deaths from tuber- 

culosis eee tye )«€6173COCA 66 71 61 62 
(6) Deaths from other 

causes 24 35 3] 48 38 61 59 68 
(7) Regarded as _ recov- 

ered Ge. j34 #708 1844+ 866 727 633 676 
(8) Moved to other com- 

munities oe set 278) 279 20Ee 66 «61635 _ 127 


(9) Removed from file 
because of unknown 
fatet 43 9] a a | 47 150 29 38 


Point prevalence (1) 7aa— W272 6853 6425 4761 94275 3745. 3413 
Incidence (2) +- (3) +- (4) 991 915 814 748 732 665 615 515 
Termination (5) through 


(9) ie, toot 1242 «2412 Teie fie 69457! 
Total population (in 
thousands) 72706 763 760 . 360, (75G  .752. 746 


* Source: annual reports from the Central Tuberculosis Dispensary, Copenhagen, 
Denmark. 

+In 1952 the criterion for recovery was changed from 5 years to 3 years freedom 
from Mycobacterium tuberculosis. In the years following 1952 the report contains 
a separate item for cases relapsed after 3 years freedom from M. tuberculosis. 

t Criterion changed: During the period 1949-1951, meaning no information for 
6 years. During the period 1952-1953, meaning no information for 5 years. During 
the period 1954-1956, meaning no information for 3 years. 
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parison of lines 5 and 6 indicates the increased fraction of deaths 
from “other” causes that can be expected when duration of a 
chronic disease is increased. This information was available 
because the center was kept informed of all deaths among its 
registrants as well as of the causes of death that were reported. 


Absenteeism 


The proportion of persons absent from work or school on any 
given day is an indication of the total point prevalence of disease 
and disability in the population. These data are not routinely 
available as official reports, but under certain circumstances, such 
as pandemic influenza, reports of total absenteeism can be re- 
quested by the health authorities. Excess absenteeism will then 
yield an important measure of the extent of undue prevalence 
of disease. Specific causes of absenteeism are usually difficult to 
determine. A diagnostic statement is rarely available before the 
worker or the pupil returns, and, even so, may be of questionable 
accuracy. 

Absenteeism may also be used as a measure of the relationship 
between total population morbidity and certain characteristics of 
individuals. For example, characteristics such as overweight and 
physical disability may be related to total number of work days 
lost in a specified period to determine whether such characteris- 
tics are liabilities to health or performance. 


Medical Statistics of the Armed Forces 


The annual reports of the Surgeons-General of the Armed 
Forces contain statistics on morbidity. Although these are limited 
to certain age groups of adult men that are selected because of 
physical fitness, the information is valuable because of its ac- 
curacy. | 


Hospital Reports 


It would be desirable to place hospital records foremost on the 
list of sources of epidemiologic information. Paradoxically, it ap- 
pears that official reports from hospitals are the least informative 
of all sources. Florence Nightingale is credited with the sugges- 
tion that all hospitals be required to report admissions by disease 
category in order that authorities may have dependable and com- 
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plete information of the disease status of the population. This has 
never been successfully accomplished, and annual hospital reports 
are usually restricted to administrative data. 

The outstanding exception to this generalization are the data 
from hospitals for mental disorders. The availability of detailed 
statistics on mental hospital patients on a local, state and national 
basis is probably related to most such hospitals being under gov- 
ernment control. Statistics are available both in the annual reports 
of specific hospitals, and, more usefully to the epidemiologist, in 
central compilations of data for the state or for the country. In 
some areas, special efforts have been made to include data for 
hospitals not directly under fiscal control of the agency collecting 
such statistics. For example, data for Massachusetts now include 
patients admitted to Veterans Administration and private hospi- 
tals as well as the State facilities. Obviously, without such in- 
clusions, major biases of the data with respect to age, sex and so- 
cio-economic status are unavoidable. Reports published on the 
basis of mental hospital data distinguish between census counts 
of patients under hospital care as of certain dates (prevalence) 
and first admissions of persons not previously in a mental hospital 
(incidence). Again these data will be strongly influenced by ad- 
ministrative decisions as to what constitutes being under hospital 
care (for example, whether this includes all patients on the books, 
or only those physically in the hospital), what is a “‘first’” admis- 
sion, and what constitutes a “discharge” from the hospital. 


Other Sources 


Particular local circumstances sometimes enable estimates of 
morbidity from specific diseases. For example, the allocation of 
priority supplies of certain foods on the basis of medical certifica- 
tion of illness during the rationing of food in Great Britain has 
enabled estimates of the prevalence of diabetes, thyrotoxicosis, 
tuberculosis, peptic ulcer and other conditions according to age, 
sex and geographic location (Stocks, 1949). 


MORBIDITY SURVEYS 


The routine sources of morbidity data discussed previously 
have mostly been designed to estimate prevalence or incidence by 
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striving for a complete disease census of the population studied. In 
addition, periodic attempts have been made to supplement these 
sources by special surveys of samples of the population. For de- 
scriptive purposes, these special surveys can be grouped in two 
general classes: 

(1) Studies of the frequency of a specific disease or group 
of diseases in the population of a limited geographic area be- 
\ieved to be generally representative of a larger population. 

(2) Special surveys of defined populations with the attempt 
to estimate morbidity from all causes. 

Perhaps the best-known examples of the first group are the 
surveys of cancer morbidity in ten metropolitan areas of the 
United States conducted by the U. S. Public Health Service in 
1937 and 1947 (Dorn and Cutler, 1959). A similar survey has 
been conducted for the State of Iowa (Haenszel et al., 1956), and 
individual investigators have published many reports of less ex- 
tensive surveys, usually covering a smaller geographic area and 
limited to a narrower group of diseases. The technique of such 
surveys is usually to assemble reports of all known cases of the 
disease in question from practicing physicians, hospitals and 
death records. The reports must of course be sifted to exclude 
persons who are not residents of the area under investigation and 
to avoid duplication of reports of the same patient from different 
sources. The variety of sources included and their completeness 
will determine to a considerable extent the accuracy of the survey. 
In a survey of the prevalence of multiple sclerosis in the City of 
Boston between 1939 and 1948, 62 per cent of the cases were 
identified from hospital records, an additional 34 per cent were 
ascertained by inquiry of practicing physicians, and 4 per cent, 
which were not recorded by hospitals or physicians, were traced 
from death certificates (Ipsen, 1950). These proportions will, of 
course, vary with time and place, and with the disease under in- 
vestigation. Thus, in the surveys of cancer in the ten metropoll- 
tan areas considerable variation between sites of cancer was noted 
in the percentage of cases reported by physicians (as opposed to 
hospitals): from 51 per cent for skin cancer to 15 per cent for 
cancers of the urinary system and of the central nervous system. 

Surveys of those diseases in which it is felt that an appreciable 
proportion of affected individuals do not come to medical atten- 
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tion often require a special procedure. With such diseases door- 
to-door interviewing may be conducted to ascertain previously un- 
known cases. This procedure is particularly pertinent to studies 
in the area of mental disease, and a number of such investigations 
have been conducted in recent years (Opler, 1958; Leighton, 
1959). For chronic diseases prevalence, but not incidence, is usu- 
ally determined by such surveys. 

The other class of morbidity survey, in which total morbidity 
is to be ascertained, involves the selection of representative sam- 
ples of the population. Since the information required can only 
be obtained directly from individuals constituting the popula- 
tion, surveys of the total population would be _ prohibitively 
expensive. Generally, information has been obtained by household 
interview, supplemented sometimes by the use of diaries written 
by the families involved. Two surveys undertaken by the Com- 
mission on Chronic Illness, one in Baltimore (1957) and one in 
Hunterdon County, New Jersey (1959), were unusual in that 
medical examinations were also offered to the persons inter- 
viewed. 

Much valuable epidemiologic data as well as information on 
methods were derived from a series of morbidity surveys con- 
ducted in this country during the earlier years of this century, 
mostly under the auspices of the United States Public Health 
Service. These included studies of seven cotton-mill villages in 
South Carolina in 1916 (Sydenstricker et al., 1918), of Hagers- 
town, Maryland, between 1921 and 1923 (Sydenstricker, 1926), 
of 9000 families selected at random from all parts of the United 
States between 1928 and 1931 (Collins, 1933), of over 700,000 
representative households from 1935 to 1936 (Perrott et al., 1939) 
and of the Eastern Health District of Baltimore between 1938 
and 1943 (Downes and Collins, 1940). 

There have been two major continuing surveys on a national 
basis. In England and Wales, the Social Survey, initiated in 1944, 
aimed to interview random samples of the population aged 16 or 
more (Slater, 1946). About 3000 persons were interviewed each 
month, and questioned as to the occurrence of illness during the 
previous 2 months. Routine results were published in Reports of 
the Ministry of Health and in the Quarterly Returns of the Regis- 
trar General, and special reports appeared from time to time 
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(Stocks, 1949; Registrar General, 1955). ‘The Social Survey was 
discontinued in 1952. 

The U. S. National Health Survey, established by Act of Con- 
gress in 1956, began collecting data in July, 1957. The program 
of the survey consists of three parts. The first is the household 
interview survey. Approximately 3000 households, selected in 
such a manner as to be representative of the population of the 
United States, are interviewed each month by trained lay inter- 
viewers (each household is interviewed only once). ‘The sample 
is selected in such a way that each week data are obtained from a 
random sample of the population. Thus although the survey 1s 
not designed primarily to demonstrate short term time trends in 
disease frequency, the data may in fact be used very effectively in 
this way, as illustrated in Figure 7 which shows the effect of the 
1957 influenza epidemic. As data accumulate from the house- 
hold interview survey, more complex and more critical tabulations 
and cross tabulations will be possible. The fact that demographic 
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Ficure 7. Weekly incidence and prevalence estimates of bed-disability due to 
acute upper respiratory disease in the United States, July 6, 1957 to February 8, 
1958. National Health Survey data. (Reproduced with permission from Linder, 
1958.) 
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and personal data as well as health information is sought at in- 
terviews ensures the derivation of the necessary “‘reference popu- 
lation” for the computation of rates in relation to variables for 
which population data are not presently available. 

Second, a series of special studies are planned as part of the 
National Health Survey. Prominent among these studies are plans 
for a continuing survey using medical examinations as well as 
household interviews. 

Lastly, a major part of the National Health Survey program 
consists of method studies designed to understand sources of error 
in the collection of morbidity statistics. It is perhaps not too great 
an exaggeration to say that the National Health Survey offers 
the potential for prompting as great an advance in epidemiologic 
knowledge in this country as did the introduction of death regis- 
tration in London four hundred years ago. 


7 


Characteristics of Persons: Age 


Age is probably the most important single variable which must 
be considered in descriptive studies. For most diseases the varia- 
tions in frequency that occur with changes in age are greater than 
those to be found in association with any other variable. Knowl- 
edge of age associations is therefore important for two reasons: 
first, study of this variation may assist understanding of the fac- 
tors responsible for the development of the disease; second, 
associations between age and disease frequency are so strong 
that they’ produce indirect effects that must be eliminated in ex- 
amination: of differences in disease rates in association with other 
variables. Comparisons. of any two populations with respect to 
disease frequency are generally':meaningless unless account has 
been taken of possible age differerices between them. 

Association of disease frequency with age is usually measured 
by relating the number of cases of the disease in each age group 
to the population in the same age group, and deriving a succes- 
sion of age-specific incidence or prevalence rates. 

Frequently, and particularly in the examination of material 
collected from clinical sources, it is possible to present the dis- 
tribution of a series of patients by age, but information is lacking 
as to the age distribution of the population from which the pa- 
tients came. Statements such as “this disease is most common be- 
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tween the ages of 50 and 60” may be made on the basis of such a 
frequency distribution. While they have meaning to the clinician 
or the administrator insofar as they indicate that more of the pa- 
tients seen are in this age group than in any other, such state- 
ments do not of course indicate that the greatest risk of the dis- 
ease lies in this age group. Even diseases for which risk increases 
with age throughout life will show decreasing numbers of cases 
in the highest age groups because the population decreases rap- 
idly with increasing age after about age 55. Thus the data on 
which Figure 8 is based showed the largest numbers of cases in 
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Ficure 8. Incidence of rectal cancer according to age and sex in ten metro- 
politan areas of the United States, 1947. (Data from Dorn and Cutler, 1959.) 
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the age group 55-59 for males and 60-64 for females but the high- 
est rates are noted in the oldest age groups for both sexes. Al- 
though the necessity of using rates to express risks is frequently 
overlooked, the point will not be labored here in view of the dis- 
cussion of rates already presented in Chapter 5. 


COHORT AND CURRENT AGE CURVES 


The association of a disease with age may be described in two 
ways: 

(1) An age curve based on the age-specific rates observed in a 
cross section of a population at one point in time. For example, 
Figure 8 represents the age-specific incidence rates of rectal can- 
cer noted in the population of ten selected areas of the United 
States in 1947. The populations on which the rates in the dif- 
ferent age groups are based belong to quite different genera- 
tions; the rate for the age group 30-34 is based on persons born 
between 1913 and 1917, that for the age group 60-64 on persons 
born between 1883 and 1887. This type of curve is known as a 
current or cross-sectional age curve. 

(2) A curve based on age-specific rates for a group of persons 
born in a certain period and observed at successive points in 
calendar time as they grew older. Thus, for persons born in 1890 
the incidence rate at age 30-34 would be based on data for the 
years 1920 to 1934, and, in the same curve, the rate for ages 60-64 
would be based on data for that age group from 1950 to 1954. 
This type of age curve is known as a generation or cohort age 
curve. 

The difference between these two methods of describing age 
associations is of importance only when the frequency of the dis- 
ease under consideration has been changing over a period of time, 
since cohort and current age curves will be identical unless there 
has been secular change in rates. If, however, there has been a 
change in incidence and if the distinction between the two meth- 
ods is not clearly kept in mind, misleading interpretations are 
likely. This was first illustrated by Andvord (1930), Frost (1939) 
and others in studies of age curves for tuberculosis mortality. 

Frost examined death rates from tuberculosis in Massachusetts 
for the years 1880 to 1930. ‘The data on which his analysis was 
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based are shown in Table 8 together with data for 1940 and 
1950 added by Pope and Gordon (1955). Current age curves 
comparable to those of Figure 8 are derived by reading down the 


TABLE 8 


Death rates* from all forms of tuberculosis for males in Massachusetts, 
1880 to 1950+ 


Year of death 


Age 
group 1880 1890 1900 1910 1920 1930 1940 1950 
0-4 760t 578 309 209 108 4] 10 4 
5-9 43 49 3] 21 24 1] 4 1 
10-19 126 Zid 90 63 9 21 6 1 
20-29 444 361 288 207 149 81 36 i 
30-39 378 368 296 253 164 115 55 25 


40-49 364 336 253 253 a7 118 86 42 
50-59 366 325 267 y ti Ps Hi 127 95 69 
60-69 475 346 304 246 172 95 108 87 


* Rates are annual death rates per 100,000 male population. 
¢ Data from Frost (1939) and Pope and Gordon (1955). 
t{ Rates for the cohort of 1875 are italicized. 


columns; they are drawn in Figure 9. Frost pointed out that in 
current age curves the peak of mortality shifted to a higher age 
in successive decades. Cohort age curves may be drawn from the 
same data by reading diagonally across the table from top left to 
bottom right; they are shown in Figure 10.* All the curves now 
have a consistent pattern. They indicate that for any given gener- 
ation, after the initially high rates in the 0-4 year age group, the 
highest tuberculosis death rates occurred in the 20-29 year age 
group. 

Two interpretations of these findings are possible. The first is 
that the marked fall in the mortality from tuberculosis was in fact 
accompanied by a change in the association of the disease with 
age — that the factors responsible for the decline in mortality 
acted differentially at different ages. The second is that there was 
no change in the age association of the disease during the period 
of decline — that the decline affected all age groups equally — 

* Note that the exact data for cohorts are not contained in the table. Thus, for the 
age group 0-4 for the cohort of 1875 data for 1877 to 1878 would be desirable, since 


this cohort occupied the center of the age group in these years. The data for 1880 
are used as an approximation. 
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Ficure 9. Tuberculosis mortality according to age (current age curves), Mas- 
sachusetts males, 1880 to 1950. (Data from Table 8.) 


but that the mortality level for any age group was characteristic 
of the cohort to which the age group belonged rather than of the 
period during which the mortality was measured. (Note that age 
association here means an association of disease mortality with 
age which is independent of mortality changes which have oc- 
curred over time.) The latter interpretation was accepted by 
Frost and others who considered the question. It involves the 
supposition that the mortality level of a cohort is determined 
very early in its existence. This supposition is consistent with the 
hypothesis that the mortality from tuberculosis in adult life is 
predominantly endogenous — resulting from the activation of la- 
tent foci — and determined for any cohort largely by the degree 
of infection received in early life. 
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Ficure 10. Data from Table 8 arranged to show the mortality experience of 
Massachusetts male cohorts born between 1855 and 1935 (cohort age curves). 


This is an example of a situation in which current age curves 
were judged to have been distorted by rate changes which had 
taken place during the past. Current age curves appeared dis- 
torted because the age association seemed best described by a 
cohort curve. However, cohort curves may be similarly distorted 
if current curves are more appropriate for the description of a 
particular situation. 

In some situations, a decision may be made on a priori grounds 
as to the type of analysis — cohort or current — which will most 
appropriately describe the age association of a particular disease. 
Thus, an epidemic of certain acute infectious diseases will affect 
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all or most cohorts living at the time of the epidemic regardless 
of their dates of birth. For example, in a study of the mortality 
rates of successive cohorts in England and Wales, Case (1956) 
noted the effect of the 1918-1919 influenza epidemic as an up- 
ward kink in the age curve at ages 15-19 for the cohort born in 
1901, at 25-29 for the cohort born in 1891, and at 35-39 for the 
cohort born in 1881. The fact that different cohorts were affected 
at different ages but that all were affected during the same calen- 
dar time suggested, as expected, that the change was characteris- 
tic of a time and not of a cohort. Note that here any single cohort 
age curve is misleading, since it suggests an “age effect” at the age 
at which that cohort was affected by the epidemic. Thus, the kink 
in the age curve at ages 15-19 for the cohort of 1901 had in real- 
ity nothing to do with age but was merely a circumstance of the 
cohort being of that age at the time of the epidemic. Misinter- 
pretation of current age curves will, in theory, not be possible un- 
der such a circumstance, since risks for all age groups will be 
based on years of equal epidemicity. 

Suppose, however, that an epidemic of a disease occurred in 
early life and that it was associated with chronic or permanent 
residua, as, for example, maternal rubella followed by congenital 
cataract. In such a situation high prevalence rates for the anomaly 
would be found at all ages of the affected cohort, but (if the 
epidemic were not repeated) there would be no period at which 
more than one age group was affected. In this case, the cohort 
age trend would best describe the age association, since for any 
cohort all values will be based on populations experiencing 
equal degrees of exposure, while current curves will indicate (in- 
correctly) an age effect in the age group occupied by the affected 
cohort at the calendar time referred to by the particular current 
age curve. 

In the examples of influenza and rubella the propriety of ap- 
plying the one or the other type of analysis was apparent. Com- 
monly, however, the issue is not clear-cut. For example, in tuber- 
culosis the hypothesis that attack rates experienced by a cohort 
early in life determine its level of tuberculosis mortality through- 
out its existence seems neither more nor less reasonable than the 
assumption that tuberculosis mortality depends on environmen- 
tal conditions existent closer to the time at which the mortality 
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is being measured. Under the first hypothesis, cohort curves 
will reflect the age association more accurately, but under the 
second, current curves would be indicated. While an a priori de- 
cision between the two methods cannot be made in this case, the 
remarkable consistency of the age curves derived by the cohort 
method suggests that the changes are in fact occurring in a pat- 
tern consistent with the first hypothesis. To be consistent with 
the second, a subsidiary hypothesis needs to be introduced to the 
effect that age groups have been unequally affected by changing 
environmental conditions. As noted by Springett (1952), evi- 
dence as to the applicability of one or the other type of analysis 
will depend on both (1) a priori reasoning, based on knowledge 
of the pathogenesis.and natural history of the disease in question, 
and (2) demonstration of a more consistent age pattern in one 
or the other type of analysis. 

It is sometimes assumed, incorrectly, that one or the other type 
of curve (usually the cohort curve) is superior for all situations. 
For example, Korteweg (1951) and others have noted that age 
incidence and mortality curves for lung cancer as derived from 
current data show a peak at ages 60-69 and thereafter decrease 
with age, while cohort age curves do not show this fall, the risk 
increasing throughout the life span. Illustrative data are shown 
in Figure 11. In commenting on these findings, several authors 
have suggested that the cohort curves portray the real change in 
risk with age, and that the pattern revealed by the current 
curve results from the striking increase in incidence of lung can- 
cer which has occurred during this century. However, this as- 
sumption may not be justified. For example, a person age 70 born 
in the cohort of 1880 achieved that age in 1950; a person age 60 
of the same cohort achieved that age in 1940. Either an increase 
between 1940 and 1950 in the risk of acquiring lung cancer or an 
increase in the chances of the disease being diagnosed could ac- 
count for an increase in the rates between age 60 and 70. Such 
an increase would be a function not of age but of time, and 
the cohort age curve would therefore be misleading if interpreted 
solely as a function of age. On a priori grounds, the hypothesis 
that lung cancer rates are in part characteristic of a cohort is cer- 
tainly a valid one in view of the large body of experimental evi- 
dence on the long latent period known to be associated with car- 
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FicureE 11. Mortality rates for cancer of the lung and bronchus, white males, United States, 1914 
to 1950. (Reproduced, with permission, from Dorn and Cutler, 1959.) (Solid lines are cohort curves, 
broken lines are current age curves.) 
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cinogenesis, but there are also features of cancer morbidity, such 
as completeness of diagnosis, which might be expected to be 
characteristic of a certain period rather than of specific cohorts. 
Equally valid grounds for the use of either current or cohort 
curves can therefore be advanced. In addition, neither type of 
curve can be said to reveal more “‘consistent’”’ results than the 
other. The age curve for lung cancer shows no convenient peaks, 
such as seen in the tuberculosis cohort curves, which would assist 
in the decision. Neither can the method of approaching the prob- 
lem discussed in connection with changes in disease rates that oc- 
cur over time (see Chapter 11) be applied to the data on lung 
cancer, for reasons discussed in that chapter (page 176). There 
is therefore, in this particular instance, no way of deciding which 
of the two types of curves illustrated in Figure 11 is the more ac- 
curate indicator of the age association of lung cancer. 


Implications from Cohort Analysis 


If a decision can be made, whether on a priori grounds, by the 
demonstration of a more consistent age pattern, or by the method 
discussed in Chapter 11, that incidence changes are apparently 
occurring in a cohort pattern, certain implications follow. 

In the first place, apparently paradoxical trends such as those 
noted for tuberculosis in Figure 9 are explained. In diseases, such 
as the neoplastic diseases, in which incidence increases through- 
out the life span the distortion of the age trend may not be so 
obvious as in diseases, such as tuberculosis, whose peak incidence 
occurs during early adult life. In such cases the distortion may 
amount merely to a steepening or diminishing of the slope of the 
age trend. The theoretical situation is illustrated in Figure 12. If 
incidence over time is increasing, and the increase is occurring 
in such a way as to affect successive cohorts, then a current age 
study tends to reduce the slope of the curve showing the actual 
association with age. If incidence is decreasing in a similar fash- 
ion, then a current age study exaggerates the slope of the curve of 
age association. Conversely, when the age association is charac- 
teristic of a period of time rather than of cohorts of the popula- 
tion, then a cohort study exaggerates the association if incidence 
is increasing and minimizes it if incidence is decreasing. An ex- 
ample of one of these situations is evident in Figure 11. Cohort 
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Ficure 12. Schematic comparison of cohort and current age Curves under con- 
ditions of increasing and decreasing incidence. 


age curves for lung cancer show a steeper slope than do current 
curves because of the increases in incidence which have occurred 
over time. 

The fact that age curves for a disease follow a cohort pattern 
influences hypotheses concerning the factors responsible for 
changes in disease incidence over time. Thus, as noted by Frost 
with respect to tuberculosis, it implies that the factors are opera- 
tive early in the existence of the cohort. In this case the supposi- 
tion is that the decreasing infection rates in the first 5 years of 
life resulted in decreased rates throughout the remainder of life. 
It is pointed out in Chapter 11 that time changes in several neo- 
plastic diseases have also followed a cohort pattern. In these 
latter diseases the evidence suggests that the environmental 
situations or personal habits which have been responsible for 
changes in the incidence of these neoplasms become established 
prior to age 35 -— the age at which the first differences between 
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cohorts are evident. It should be noted that this implication ap- 
plies only to the changes in disease incidence which have occurred 
over time. Thus, conformity to a cohort pattern does not imply 
that all the factors responsible for later tuberculosis mortality 
are operative in the early years. There undoubtedly are factors 
operative later in life also. The implication is, however, that the 
latter factors have not changed to any considerable extent over 
the period examined, and that the change in incidence at later 
ages has resulted primarily from changes in factors operative 
early in life. 


INTERPRETATION OF AGE ASSOCIATIONS 


Innumerable factors enter into the determination of the shape 
of the age association curves of a given disease. Only a few of 
these are understood. 

At the most superficial level, there are a number of circum- 
stances which in fact have little to do with age per se, but which 
may nevertheless affect the shape of an age incidence curve. One 
of these — the use of current age curves when cohort curves are 
appropriate, and vice versa — has already been discussed. Vari- 
ations in accuracy of diagnosis according to age must also be con- 
sidered in this category, particularly in regard to the oldest age 
groups. [There is, in many clinical situations, less concern over 
establishing an exact cause of death for an 80-year-old than for a 
40-year-old person. This situation tends to reduce age-specific rates 
for the oldest age groups for diseases of difficult diagnosis, and to 
exaggerate rates in the same age groups for “wastebasket” diag- 
noses. In addition, basic population data may be inaccurate for the 
oldest age groups, leading to further distortions. The latter sit- 
uation applies in the United States particularly to the non-white 
population. 

If the rate of progression of a disease varies with age, age-risk 
curves may vary according to whether they express rates of on- 
set, diagnosis or mortality. For example, in a disease in which the 
average interval between onset and death increases with age, a 
mortality age curve will in general be less steep than an incidence 
curve. Even when case fatality and average survival time are the 
same for all ages, the mortality curve (shifted, of course, to- 
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wards the right of the age scale) may be different in shape from 
the incidence curve for the same disease. Thus, if a final inclusive 
age group is used (e.g., 75 and over), mortality rates will be 
higher than incidence rates in that age group while the reverse 
will be true in the earlier age groups. An illustration of the dif- 
ferences between age curves based on incidence, prevalence and 
mortality data is shown for a chronic disease — multiple sclerosis 
— in Figure 13. | 
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Ficure 13. Rates of incidence, prevalence and mortality per 100,000 of mullti- 
ple sclerosis in males, Boston, 1939-1948. (Data from Ipsen, 1950.) 


If the disease under examination is of such a very chronic na- 
ture that a considerable proportion of those affected die from 
other causes before death occurs from this disease, then the mor- 
tality age curve of the disease will be influenced by the current 
age-specific risks of dying from all causes — provided, of course, 
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that all cases in which the disease is present at the time of death 
are used in the calculation of the curve. This is merely a restate- 
ment that mortality data are subject to misinterpretation if the 
disease is not one in which death occurs in a high proportion of 
the patients, and within a reasonably short time of onset. 

Bimodality — that is, the occurrence of two separate peaks — 
in an age incidence curve of a disease is always of interest. It in- 
dicates, first of all, that the material is not homogeneous — that 
the entity under examination might properly be divided into two. 
Even when an experiential, rather than a manifestational, entity 
is being examined, bimodality suggests the existence of experien- 
tial differences other than that on which classification of the dis- 
ease is based. For example, the age curves for tuberculosis in 
Figure 10 (page 91) are bimodal, showing one mode in the 0-4 
age group and a second in the 20-29 age group. Since tubercu- 
losis is defined in terms of experience with the tubercle bacillus, 
these two modes are both affecting the same “disease entity.” 
This epidemiologic finding suggests that two distinct sets of an- 
cillary experiences must be taken into account to explain the en- 
tity’s observed age distribution. 

Age incidence curves for the manifestational entity Hodgkin’s 
disease are also distinctly bimodal, showing one peak in early 
adult life and a second in old age (Figure 14). Probably, there- 
fore, parts of two distinct experiential entities are congruent with 
this manifestational entity as presently constituted. This hypothe- 
sis has been supported by the demonstration that the disease in 
persons under 40 years of age differs from that in persons over 
that age with respect to sex ratio, racial, religious, and geo- 
graphic distribution, and duration of illness (MacMahon, 1957). 
Apart from suggesting the presence of separate experiential .en- 
tities, such evidence can be of service in the finer definition of 
manifestational entities if clinical and pathologic findings are 
re-examined in the light of the epidemiologic differentiation. 

A common type of age curve is that exhibited by the so-called 
degenerative diseases in which incidence rates increase progres- 
sively throughout the life span. Understanding of the mechanisms 
responsible for this type of age curve is a problem which is 
only in part epidemiologic. At present it seems likely that funda- 
mental studies of an experimental type have most to offer in the 
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Ficure 14. Incidence of Hodgkin’s disease in the white population of Brook- 
lyn, 1943 to 1952. (From MacMahon, 1957.) 


understanding of those bodily changes which occur with age and 
which lead to increasing rates for so many different diseases. 
Probably the explanations of such associations are as numerous 
as the diseases which exhibit them. Certainly change in no single 
factor is likely to be responsible for incidence changes in so many 
fundamentally different disorders. 

In considering the age curves for neoplastic disease, Huxley 
(1951) has suggested that they result from a genetic selection 
which reduces the number of persons susceptible to tumors genet- 
ically destined to occur prior to the end of the reproductive years. 
This is clearly not a complete explanation, since those people or 
tumors surviving such an evolutionary process Must possess some 
characteristic which destines onset of the tumor after the repro- 
ductive age. We are still faced with the elucidation of this 
characteristic. There is little evidence to support Huxley’s hypoth- 
esis; the original premise on which it was based — that tumors 
which occur during the reproductive years are less common than 
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those occurring later — has been shown to be untrue (Landauer 
and Beall, 1952). 

It is not known whether there are truly endogenous changes 
with age which affect susceptibility to disease — that is, changes 
which are the inevitable consequences of the passage of time and 
which are completely independent of environmental circum- 
stances to which the individual is exposed during that time. Again 
with reference to neoplastic disease, Hueper (1952) has supported 
the view that the shape of the age curve is determined largely by 
the length of the latent period between encounter with carcino- 
genic agents and manifest neoplasia, by the age at which such ex- 
posure first takes place, and by the intensity of exposure. The 
long latent periods found in human neoplastic disorders follow- 
ing exposure to known carcinogenic agents (up to 50 years) 
make it inevitable that such neoplasms should occur in the older 
ages, even if exposure was first effective at birth. Hueper points 
out that the age distribution of scrotal cancer and other occupa- 
tional neoplasms varies according to the age at first exposure to 
the carcinogenic agent. In addition, there is no experimental 
evidence that the latent period between initiation of exposure 
and development of neoplasia is any longer for young than for 
old experimental animals. There is therefore a good deal of sup- 
port for Hueper’s hypothesis. 

However, this view is not so different as it might appear to be 
at first sight from the hypothesis of changing tissue susceptibility 
with age. “Aging,” as a characteristic of any individual, is merely 
the passage of time; this passage of time must necessarily be ac- 
companied by exposure to environmental situations which alter 
the body chemistry and physiology. Among the most obvious 
of these situations are those necessary for the continuation of 
life such as the intake of food and oxygen. Some of these altera- 
tions may result in increased incidence of certain diseases. If the 
changes result from exposure to a known toxic agent, and if such 
exposure is avoidable and therefore not a necessary part of the 
aging process, such as, for example, in the case of occupational 
carcinogens, we would not regard this as being a primary associa- 
tion with age, even though aging is essential to the accumula- 
tion of a sufficient dose of the toxin. If, however, the disease-in- 
ducing changes were produced by environmental situations which 
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were essential concomitants of the aging process, such as, for ex- 
ample, prolonged consumption of oxygen, should this be re- 
garded as a primary effect of age or not? Certainly, aging would 
inevitably be accompanied by such changes. Between these ex- 
tremes are a wide variety of situations in which environmental 
circumstances may be separated to a greater or lesser degree from 
aging. For example, it may be that some component of the in- 
cidence of human leukemia can be attributed to exposure to 
background radiation from natural sources. At present aging 
is necessarily accompanied by cumulative exposure to background 
radiation, and the relationship between age and leukemia inci- 
dence may, under this hypothesis, be regarded as a fairly direct 
one. However, if methods of protection were devised such that 
only artificial sources of radiation contributed to the total radia- 
tion experience, leukemia incidence as influenced by background 
radiation might be regarded as independent of age. 

Whether or not there is a truly endogenous aging process (a 
“biological clock’’) resulting in increased disease susceptibility, 
the evidence assembled by Hueper indicates that there are cer- 
tainly many alterable situations, separable from age, which con- 
tribute to the shape of age incidence curves. The existence of a 
progressive association with age cannot ipso facto be regarded as 
evidence of endogenous causation and thus as grounds for pessi- 
mism with regard to preventive possibilities. Practically, effort 
should be concentrated on the elucidation of those deleterious en- 
vironmental exposures which accompany but are nevertheless sep- 
arable from age. When these have been eliminated, greater con- 
cern may be felt over the possibility of an endogenous component 
in the aging process. 
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Characteristics of Persons: 


Sex and Ethnic Group 


This chapter continues the discussion of variables that are 
descriptive of the types of person affected by disease. It is con- 
cerned with two variables, sex and ethnic group, which in con- 
junction with age are commonly thought of as the basis of descrip- 
tive epidemiology. Their examination here is intended to illustrate 
the types of epidemiologic analysis and interpretation that can be 
applied to descriptive variables in general. 


SEX 


Measurement of Association 


As with descriptive variables other than sex, the association be- 
tween a particular disease and the variable is most satisfactorily 
demonstrated by a difference in disease rates between subgroups 
of the variable —in the case of sex between disease rates for 
males and for females. In an easier (and commonly used) pro- 
cedure deviations in the sex ratio* from 50 per cent are noted, 
under the assumption that the sex ratio in the population from 
which the cases were drawn is also 50 per cent. It is a fact of life, 
however its justice may be argued, that this assumption may in 


* Number of affected males as a percentage of the total affected. 
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specific instances not be entirely correct. For example, among 
the 1955 Massachusetts population 70 or more years of age the 
sex ratio was only 44 per cent. Calculation and comparison of 
sex-specific rates are therefore preferable in that one fewer assump- 
tion needs to be considered in the analysis of findings. 

In most populations the two sexes also have different age dis- 
tributions, females commonly being on the average older than 
males. Small differences in sex-specific rates must therefore be 
viewed with caution unless the possible influence of age differ- 
ences has been taken into account. 


Sex Distribution of Disease 


More males are born alive than females in a ratio of about 106: 
100. Death rates have consistently been higher for males than for 
females, except at child-bearing ages in situations where stand- 
ards of obstetric care have been poor, when rates for females have 
been observed to exceed those for males. The disparity between 
male and female death rates in the United States at various ages 
is shown for 1955 in Table 9. Of interest is the fact that the 
death rates for United States males are currently highest relative 
to those for females during the child-bearing ages of females. 


TABLE 9 


Sex-specific mortality rates* in broad age groups in the white popula- 
tion of the United States, 1955 


Male rate as 


Ae Mortality rates percentape ae 

Total Male Female female rate 
Under | 26.2 29.8 22.4 133 
1-4 1.0 |e 0.9 122 
5-14 0.5 0.6 0.4 150 
15-39 1.4 1.8 0.9 200 
40-64 9.1 12.0 6.2 194 
65 and over 61.5 Flat 52.8 136 
All ages+ 8.9 10.9 12 151 


* Rates are annual death rates per 1000 population. 

+ For “all ages” and within each broad age group the rates are standardized by 
the direct method to the age distribution in 5-year age groups of the total United 
States population in 1955. 


Characteristics of Persons: Sex and Ethnic Group 105 


In the United States the general disparity has been increasing. 
Illustrative data for 1933* and 1955 are presented in Table 10. 
Comparison of age-standardized mortality rates in the two years 
indicates that the increased difference results because the rates 
for females fell more rapidly than those for males. 


TABLE 10 


Mortality rates* for males and females of the white population of the 
United States, 1933 and 1955 


Mortality rates Male rate as 


Year percentage of 

Male Female female rate 
1933 14.0 11.5 122 
1955 10.9 12 151 


* Rates are annual death rates per 1000 population, and are age-standardized as 
in Table 9. 


The less fortunate male position is not the result of serious dis- 
advantage in only a few of the assigned causes of death (Table 
11). Although considerable variation in sex pattern is evident be- 
tween assigned causes, death rates among males exceeded those 
among females for the majority of the causes, commonly by as 
much as 150 per cent or more, and in one instance (respiratory 
cancer) by 585 per cent. Among the commonest causes of death, 
excluding deaths assigned specifically to diseases of the sex or- 
gans, only diabetes mellitus and hypertensive heart disease 
showed higher death rates among females than among males. 

A seemingly paradoxical finding in surveys of morbidity (rather 
than death) reveals that illness rates, defined in a variety of ways, 
have consistently been found higher among females than males. 
Data from four such surveys are shown in Table 12. In spite of 
gross variation in definitions of illness and incapacity, and the 
different measures of frequency used, rates for females are con- 
sistently and substantially higher than those for males. ‘This consist- 
ency indicates either that females admit to illness more readily 
than males or that they experience a higher incidence of illnesses 
which, in view of the preceding mortality data, must presum- 


* This year is chosen for comparison because it is the first year in which data for 
the whole United States are available. 
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TABLE 11 
Mortality rates* from selected common causes for males and females 
in the United States, 1955 


nnn nnn nnn ee LEE 


Mortality rates pints sate 
Assigned cause ) ee 
Total Male Female _ of female rate 

Respiratory cancer 18.0 31.6 ae 585 
Peptic ulcer 5.9 9.6 25 384 
Suicide 10.1 16.2 4.5 360 
Homicide 4.5 7.0 2.1 333 
Motor vehicle accidents 23.3 35.9 11.4 315 
Syphilis 2.3 3.5 r2 292 
Tuberculosis 9.1 13.3 5.2 256 
Urinary cancer 7.0 9.8 4.6 als 
Cirrhosis of liver 10.2 13.8 6.7 206 
Accidents, other than 

motor vehicle 34.5 46.1 25:2 199 
Arteriosclerotic heart 

disease 251.6 336.0 174.4 193 
Appendicitis 1.4 1.8 1.0 180 
Digestive cancer 52.4 61.2 44.6 157 
Vascular lesions of the 

central nervous system 109.8 1th 105.3 109 
Hypertensive heart disease 46.1 44.5 alee 94 
Diabetes mellitus 15.5 13.1 ies 74 
Genital cancer 23.0 19.7 26.8 74 


* Rates are annual mortality rates per 100,000 population. They are age-stand- 
ardized as in Table 9. 


ably be less lethal than those experienced by males. Data from the 
U. S. National Health Survey (1959) show that this sex pattern 
for morbidity is evident in all gradations of socio-economic sta- 
tus, in urban and rural areas, and when the examination is re- 
stricted to gainfully employed persons. When separate causes of 
illness are considered, women have higher morbidity rates than 
men for the majority of chronic diseases. Many chronic condi- 
tions are strikingly more common in women — for example, thyro- 
toxicosis, diabetes mellitus, cholecystitis and biliary calculi, obes- 
ity, arthritis, psychoneurosis and many benign tumors. Only a few 
illnesses appear to be predominantly masculine diseases when ex- 
amined in morbidity data; these include peptic ulcer, inguinal 
hernia, accidents, arteriosclerotic heart disease and respiratory 
cancer. 
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TABLE 12 


Morbidity rates for males and females in four surveys 


Rates per 1000 


Survey Measure of illness or incapacity —————— 
Males Females 

U.S. “9000 fami- Total illnesses (annual prevalence) 720 915 

lies” survey, Nondisabling illnesses (annual preva- 
lence) 285 374 
1928-1931 Disabling illnesses (annual prevalence) 435 54] 
(Collins, 1940) Bed illnesses (annual prevalence) 345 476 
Days sick per year ze2l7 35241 
Days in bed per year 3046 4699 

Baltimore “East- Acute illness (annual prevalence) 1044 1474 
ern Health Dis- Chronic illness (annual prevalence) 19] 285 
trict” survey, Doctor’s calls (annual incidence) 2123 2706 
1938-1943 Hospital admissions (annual incidence) 67 83 
(Downes, 1950) 

British “Survey Total illnesses (monthly prevalence) 786 1094 
of Sickness,” Total sick persons (monthly prevalence) 487 588 
1946-1947 New major illnesses (monthly incidence) 46 b2 
(Stocks, 1949) | New minor illnesses (monthly incidence) 259 328 

Days incapacity in month 1079 979 
Medical consultations in month 599 420 

U.S. National Acute illness (annual incidence) 2716 2720 
Health Survey _ Restricted-activity days (annual) 17700 22200 
1957-1958 Bed-disability days (annual) 6900 8700 
(7959) Hospital discharges (annual incidence) 74 83 


ETHNIC GROUP 


A variety of terms might be selected to distinguish groups of 
persons who have a greater degree of homogeneity than the popu- 
lation at large, whether from some degree of homogeneity of 
present-day customs or from a common biologic inheritance. 
While the use of the word “race” extends from a “permanent 
variety of the genus Homo” on the one hand to groups with as 
little common background as “‘the race of doctors’? (Webster) on 
the other, “‘race’’ is so often equated with the former rather than 
the latter usage that we prefer “ethnic group” as conveying more 
clearly the intended epidemiologic concept. 

Actually, the epidemiologist is interested in ascertaining dis- 
ease rates among any groups whose members have lived together 
and/or intermarried for a sufficient length of time to have ac- 
quired common characteristics whether by biologic or by social 
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mechanisms. For practical reasons such groups must be identified 
by commonly applied descriptive terms, These include race (in 
its restricted sense), nativity, religion and local reproductive and 
social units, such as tribes and geographically or socially isolated 
groups. 

In the discussion of sex differences, mention was made of the 
uncertainty of inferences derived from sex ratios rather than from 
sex-specific rates. A similar situation arises frequently in the 
comparison of ethnic groups when either of two methods is used: 

(1) The frequency of persons of a given ethnic group with a 
specified disease is expressed as a percentage of the number of the 
same ethnic group with all diseases or a certain group of diseases, 
and these frequencies are compared between ethnic groups. 

(2) The frequency of persons of a certain ethnic group with 
a certain disease is expressed as a percentage of the total number 
of persons with the same disease and compared with similar 
percentages among patients with other diseases or among all pa- 
tients seen at a given hospital. 


TABLE 13 


Mortality rates* from cancer in Bantu mine workers and 
American Negroest 


Site Bantu American Negro 
Liver cancer ¢ a4 3.0 
Other cancer LS 61.5 
All cancer 14.0 64.5 


* Rates are annual mortality rates per 100,000 population, and are not age-stand- 
ardized. 


+ From Gilliam, 1954. 
t This includes cancer of the biliary passages. 


Both these methods are likely to result in incorrect inferences, 
since the population at risk is ignored. For example, Berman 
(1940) noted that 91 per cent of cancers among Bantu laborers 
in Witwatersrand gold mines were primary cancers of the liver. 
Since the liver contributes only about | per cent of cancers among 
European whites, it was inferred that the Bantus suffered an ex- 
traordinarily high rate of primary liver cancer. However, by the 
use of race-specific rates, Gilliam (1954) has shown that the ex- 
traordinary feature of the Bantu experience is not the high fre- 
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quency of liver cancer but the striking rarity of all other forms 
of cancer. While the Bantus studied do have somewhat higher 
rates of liver cancer than American Negroes (and European 
whites), the excess is not nearly as great as that suggested by the 
comparison of relative frequencies (Table 13). 


Race 


A widely used concept of the word “race” is in its application 
to the major subdivisions of man as they exist at the present time 
— White, Negroid and Mongoloid. Much epidemiologic work has 
been done on the comparison of disease rates in “racial” groups 
defined in this way. In particular, in the United States, differences 
in disease rates between the white and Negro populations have 
been extensively studied and have been productive of hypothe- 
ses. Mortality statistics contain a great deal of information on 
this point, some of which is shown in Tables 14, 15 and 16. As in 
the case of the information shown in these tables, most of the 
published mortality data are tabulated to show only rates for 
“whites” and “non-whites,’ without further subdivision of the 
non-white population. However, since approximately 95 per 
cent of the non-white population of the United States is Negro, 
rates for ‘“‘non-whites” can be considered to reflect the rates for 
Negroes with some accuracy. 

In Table 14 it is seen that the total age-standardized mortality 


TABLE 14 


Mortality rates* for whites and non-whites in broad age groups in the 
United States, 1955 


Rate in whites as 


Age group Mortality rates percentage of 
(years) Total White Non-white rate in non-whites 
Under 1 29.6 26.2 51.4 5] 
1-4 | 1.0 2.0 50 
5-14 0.5 0.5 0.6 83 
15-39... 1.6 1.4 3.2 44 
40-64 9.8 9.1] 17.2 53 
65 and over 61.5 61.5 61.6 100 
All ages 9.3 8.9 be ap 71 


* Rates are annual mortality rates per 1000 population and are standardized as 
in Table 9. 
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rate for non-whites was some 35 per cent higher than that for 
whites in 1955. Table 15 indicates that mortality rates have fallen 
considerably between 1933 and 1955 in both groups, the fall be- 
ing somewhat greater for non-whites than for whites. 


TABLE 15 


Mortality rates* for whites and non-whites in the United States, 
1933 and 1955 


ate in whites as 
Mortality rates ate 


Year percentage of 
White Non-white rate in non-whites 

1933 12:7 20.4 62 

1955 8.9 12.5 71 


* Rates are annual mortality rates per 1000 population, age-standardized as in 
Table 9. 


Table 16 shows rates of death assigned to selected causes in 
whites and non-whites in 1955. The over-all death rates in the 
two groups are derived from widely divergent patterns. Among 
the patterns that have stimulated the greatest epidemiologic in- 
terest are the high death rates from poliomyelitis in whites con- 
trasted with the higher rates of several other infectious dis- 
eases (tuberculosis, syphilis, whooping cough) in non-whites, the 
higher rates of hypertensive heart disease and vascular lesions of 
the central nervous system in non-whites contrasted with the 
higher rate of arteriosclerotic heart disease in whites, and the 
differences observed for cancers of various sites. In addition to 
the differences in the common sites of cancer shown in Table 16, 
cancer rates for non-whites are unusual compared to those of 
whites in showing high rates of penile cancer and low rates of 
testicular and skin cancer. A few diseases show virtual restric- 
tion to one or the other racial group. The best-known example, 
and perhaps the commonest, is sickle-cell anemia which occurs 
only in persons of Negro ancestry. 

As previously noted (page 105), sex patterns in disease occur- 
rence are different in morbidity data from those suggested by 
mortality data. Unfortunately, there have been very few data 
available on racial differences in morbidity patterns, several of the 
important morbidity studies having been restricted to one or the 
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other racial group in the interests of obtaining a homogeneous 
study population (Collins, 1933; Downes and Collins, 1940). Also, 
data on racial differences have not yet been published from the 
U. S. National Health Survey, although the sampling procedure 
does not exclude any racial or ethnic group; no doubt this will 
eventually be a valuable source for such information. 


TABLE 16 


Mortality rates* from selected causes for whites and non-whites in the 
United States, 1955 


Mortality rates Rate in whites as 


Cause percentage of 
White Non-white rate in non-whites 

Poliomyelitis 0.7 0.2 350 
Suicide 10.7 4.2 255 
Leukemia 6.8 4.1 166 
Urinary cancer fee's a1 141 
Arteriosclerotic heart 

disease 256.4 190.4 135 
Respiratory cancer 18.0 16.1 112 
Cirrhosis of liver 10.2 9.) ii 
Peptic ulcer 5.3 5.4 109 
Digestive cancer 52.1 55.1 95 
Motor vehicle accidents 22.9 27.3 84 
Diabetes mellitus 15.2 19.1 80 
Accidents, other than 

motor vehicle 32.8 47.6 69 
Vascular lesions of the 

central nervous system 104.7 167.2 63 
Genital cancer 21.7 34.8 62 
Pneumonia 24.0 49.7 48 
Hypertensive heart disease 40.2 114.1 35 
Tuberculosis ve.) 25.4 29 
Whooping cough 0.2 1.2 17 
Syphilis 1.6 10.2 16 
Homicide 2.3 23.8 10 


* Rates are annual mortality rates per 100,000 population, standardized as in 
Table 9. 


Racial differences other than those between whites and Negroes 
have received less attention, mainly because comparable data on 
other racial groups have not been available. There have, how- 
ever, been special studies, for example, of the mortality from 
cancer among Japanese and Chinese groups living in the United 
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States (Smith, 1956a, b). Other studies of differences between 
the United States and some countries of Asia with respect to 
specific disease frequencies are discussed in connection with the 
descriptive variable ‘‘place’’ (see Chapter 10). 

Nativity 

When people migrate from their country of birth, patterns of 
diet, occupation, recreation and so on tend to be maintained for a 
variable length of time in the new country. Country of birth is 
therefore a valuable objective characteristic which can be used to 
distinguish within the same community groups whose environ- 
mental patterns and inherited characteristics may differ from 
those of other immigrant groups and from the nativeborn. Cer- 
tain of these environmental characteristics may be maintained 
in more or less diluted form for one or more subsequent genera- 
tions, so that nativity of parents or of grandparents may also be 
used to distinguish groups with different environmental experi- 
ences. 

In the United States, full use has not been made of the rich ma- 
terial available in this connection. In part this is due to the lack 
of sufficiently detailed population data. For example, in recent 
censuses data have not been published for small demographic 
units on the age distribution of the foreignborn population ac- 
cording to specific country of birth. This considerably limits epi- 
demiologic use of this variable, since immigration from specific 
countries to the United States tended to be concentrated in cer- 
tain periods of time and to involve particular age groups; statisti- 
cal comparisons of nativity groups that do not consider the 
question of age are usually quite meaningless. Much of the work 
in this area has therefore been done by special studies rather than 
by the collation of regularly reported data. A well-known example 
is the demonstration by Zinsser (1934) that cases of typhus occur- 
ring in New York City between 1910 and 1933 were almost exclu- 
sively limited to persons born in Eastern Europe where typhus 
was endemic. This observation led to the concept of typhus oc- 
curring as the recrudescence of a long-latent infection. 

In New Zealand it has been shown that death rates from lung 
cancer were higher among persons born in England than among 
those nativeborn. This difference suggests the possibility that 
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early environment may play some role in establishing later lung 
cancer risk (Eastcott, 1956). 

Malzberg (1940) has studied rates of admission to mental 
hospitals in relation to country of birth for the population of 
New York State. Admission rates were higher among foreignborn 
than among nativeborn persons, and were higher for those born 
in Ireland or Scandinavia than for the three other large foreign- 
born groups (Italian, German, English). There were striking 
differences in this pattern between the several major diagnostic 
groups of mental illness. This study was made possible by the pro- 
vision of special census data on the age distribution of the foreign- 
born population. 

In the United States, disease rates have been compared in the 
nativeborn and foreignborn populations without respect to spe- 
cific country of birth of the foreignborn. As an indicator of en- 
vironmental experiences associated with ethnic group such 
examinations are of limited value, since the foreignborn popula- 
tion is extremely heterogeneous with respect to ethnic back- 
ground, and the total experience of the foreignborn may be 
indicative merely of the relative proportion of persons of specific 
nativity in the studied populations. However, particularly in stud- 
ies of mental disease rates, such examinations have been of value 
in suggesting the possible influences common to people who have 
changed their country of residence. This question is considered 
further in the discussion of migration as a variable per se (see 
Chapter 11). 


Religion 

Ethnic background and religious belief are so strongly corre- 
lated that religion may be used as an indicator of ethnic group. 
For example, persons of Irish and Italian ancestry in the United 
States are predominantly Catholic, and those of British and 
Scandinavian ancestry predominantly Protestant. The more re- 
cent the ancestral migration, the stronger this association. In 
most American communities, however, both Catholic and Protes- 
tant groups include such a variety of ethnic backgrounds that these 
religions may be of little practical use for identifying groups that 
have any degree of ethnic homogeneity. The Jewish religion, on 
the other hand, does appear to identify a group that, perhaps be- 
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cause of a common ethnic background (in most American com- 
munities the Jewish group is predominantly Eastern European in 
ancestral background), and perhaps because of a closer influence of 
religion on everyday practices, is more homogeneous with respect 
to certain characteristics (e.g., circumcision, moderate use of alco- 
hol and tobacco, and occupation) than the population at large. 
There is evidence that, at the present time, the Jewish religion 
also identifies a greater degree of genetic homogeneity than exists 
in the American population as a whole. Thus religion is capable 
of distinguishing groups with respect to rates of certain diseases 
and characteristics known to be genetically determined — for 
example, Tay-Sachs disease and ABO blood group. It would seem 
likely that the usefulness of religion as an epidemiologic variable 
in this particular context may be expected to diminish rapidly in 
succeeding generations. 

On the other hand, some less common religious groups have 
established small social and reproductive units that show no sign 
of integration with other groups and may maintain if not increase 
their homogeneity. ‘These groups, however, are usually of a local 
nature similar to those discussed in a later section of this chap- 
ter (page 116). 

While “religion” may be the actual characteristic used to 
identify population groups, it is clear that explanations for dif- 
ferences in disease rates may or may not be sought in some feature 
of religious practice. On the one hand, specific methods of birth 
control, circumcision, and abstinence from tobacco are features 
respectively of Catholics, Jews, and Seventh Day Adventists that 
may be thought of as part of religious practice. All have been 
put to epidemiologic use. On the other hand, prior to the study of 
Zinsser mentioned previously (page 112), typhus in New York 
City was thought of as characteristically a disease of Jews. It had 
no more to do with the Jewish religion than the circumstance 
that the majority of the immigrants from Eastern Europe to New 
York City at that particular time were Jewish; nevertheless, the 
observation that most of the affected individuals were Jewish 
led to the eventual correct explanation of the occurrence of cases 
in an area where typhus was not known to be endemic. 

A great deal of interest in religious differences has focused in 
the field of cancer, probably because of the well-known infre- 
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quency of cancer of the cervix among Jewish women. This in- 
frequency has been noted among Jewish women in many different 
countries and economic groups. An examination of mortality rates 
from cancer in New York City according to religion indicated that 
cervical cancer was not the only form of cancer to show striking 
differences according to religion (Table 17). There are few ap- 
preciable differences between the rates for Catholics and Prot- 
estants evident in the table. As noted, these religions contain a 
variety of ethnic groups, and an unusual cancer experience of 
one ethnic group might not be revealed in such an examination. 
However, rates for the Jewish group show comparatively high 
levels for lymphosarcoma, leukemia, glioma, melanoma, and 
cancers of the kidney, pancreas and large intestine; and low 


TABLE 17 


Estimated death rates* from selected malignant neoplasms in three 
religious groups in New York City, 1955 


Males Females 

Site Cath- Prot- Cath- Prot- 

olic Jewish  estant olic Jewish — estant 

Cervix uteri — — — 20.9 8.5 a 

Tongue 8.7 ye 8.3 — — — 

Buccal and pharynx —=15.3 gE 12.4 — — — 
Esophagus 21.1 12.9 213 Ped ie So 

Larynx 18.0 (P's 16.9 ms — — 
Lung 49.8 34.3 55.7 7.7 75 6.9 
Stomach 61.8 76.4 75.9 40.9 48.5 39.9 
Rectum 41.9 39.9 45.0 30.4 26.7 be 

Prostate 41.8 30.0 49.8 — _ =e 
Bladder ce Ws 23.8 30.8 10.1 7.0 13.5 
Ovary ~ — — 22.4 30.7 27.) 
Lymphosarcoma 5.4 14.6 6.8 4.4 9.0 4.4 
Leukemia 18.2 19.7 13.1 8.8 18.9 18.2 
Glioma 3.8 7.4 sm | 3.6 5.8 3.8 

Melanoma } he 5a ri — — — 
Kidney 10.6 18.2 13:1 4.9 8.4 4.1 
Pancreas 27.8 38.4 TiS 18.7 26.3 19.0 
Large intestine 55.9 83.8 ct 6o.1 77.0 98.9 

All malignant 

neoplasms 585.1 594.6 638.4 437.3 510.9 467.7 


* Rates are annual rates per 100,000 population aged 45 years and over. 
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rates are seen for neoplasms of the tongue, upper respiratory 
tract, esophagus and prostate, as well as for cervical cancer. 


Local Reproductive and Social Units 


Small local populations that have maintained environmental 
and some genetic homogeneity through inbreeding have of course 
been the subject of intensive anthropologic study, but have been 
little used for epidemiologic purposes. Perhaps this relates in 
part to the difficulty of assessing frequencies of any but the most 
common diseases in groups of such small size. However, there is 
increasing imterest in the addition of accurate medical data to 
anthropologic studies. Preliminary investigations of certain North 
American Indian tribes, for example, show disease patterns so 
strikingly different from those of North American whites that 
closer study seems indicated (Hadley, 1955; Smith et al., 1956). 

Within the white populations of Europe and the United States 
a few such localized groups exist. For example, there have been 
studies of specific disease rates among isolated inbreeding Alpine 
populations. Most of these, however, have been interpreted with 
particular reference to genetic hypotheses. Religious belief is 
also occasionally associated with the formation of local groups 
which maintain varying degrees of inbreeding. One of the most 
extreme of such groups is perhaps the Hutterites, who have 
maintained an almost completely isolated genetic and social unit 
for three centuries. There are only fifteen family names in the en- 
tire sect. The unique characteristics of this group have been uti- 
lized in a recent study of the frequency and nature of mental dis- 
orders among them (Eaton and Weil, 1955). 


INTERPRETATION 


Associations with descriptive variables are sought primarily for 
the purpose of developing hypotheses concerning the role of 
specific experiences in disease causation. In interpreting associa- 
tions with descriptive variables, therefore, the objective is fre- 
quently not to find the explanation of the association, but rather 
to suggest a number of different possible explanations. The 
former objective, while it might be desirable, is usually unattain- 
able. As noted in Chapter 4, substantial hypotheses rarely come 
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from knowledge of an association with one variable. But, if asso- 
ciations with two or three variables are known, each one of which 
may have a number of explanations, hypotheses may frequently 
be derived by noting which explanations are common to all (or to 
most) of the noted associations. 

The following is suggested as a simple check list to be used in 
seeking possible explanations of descriptive associations. It is a 
check list rather than a classification, since any one explanation 
may conceivably appear in several of the categories. It applies 
to most descriptive variables, although it is discussed here in 
the context of sex and ethnic group. 


(1) Errors of measurement 

(2) Differences between groups with respect to more di- 
rectly associated demographic variables 

(3) Differences in environment 

(4) Differences in bodily constitution 

(5) Differences in genetic constitution 


Errors in measurement may result in apparent associations at 
several levels. For example, there may be differences between 
groups in access to medical care and hence diagnosis. Minor ill- 
nesses may be more frequently diagnosed in certain groups of 
males than in females because of ready access to industrial 
medical care facilities. United States national statistics indicate 
lower rates of congenital malformation for Negroes than for 
whites, but a considerably greater proportion of deliveries of 
Negro infants occurs in rural areas of the South where medical 
care (and hence observation) is inadequate. Even when facilities 
are equally available to all groups, there may be differences in 
utilization. For example, women are more reluctant than men to 
undergo physical examination. Even in the same medical care 
plan, utilization rates tend to be lower for Negroes than for 
whites, and for Catholics and Protestants than for Jews (in the 
United States). 

Differences in precision of diagnosis must also be considered, 
even if access to and utilization of medical care can be considered 
equal. Thus, venereal disease is more difficult to diagnose in the 
female than in the male. Certain skin rashes may be more difficult 
to diagnose in Negroes than in whites. 
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Even if rates of diagnosis are equal, reporting may vary. For 
example, in morbidity surveys in which anamnestic data are 
sought by household interview it is most commonly a female who 
is interviewed, and the reporting of data for females in such 
households is therefore likely to be more complete than that for 
males. Differences in reporting frequencies also occur between 
ethnic groups, for example, in connection with the reporting 
of minor degrees of mental disturbance. 

Errors of measurement commonly associated with the various 
sources of descriptive data have been discussed elsewhere (see 
Chapter 6). The understanding and consequent reduction of 
such errors comprises a major part of the research program of 
the U. S. National Health Survey (1958). 

Differences between groups with respect to more directly as- 
sociated variables are most obvious in relation to age differences, 
which are usually present between sex or ethnic groups. It was 
noted in Chapter 7 that age variation in disease rates is usually so 
marked that it is impossible to examine associations with other 
variables unless the data are adequate to eliminate the possible 
effect of age differences between the compared groups. This 
may be done by examination of age-specific rates for the com- 
pared groups or by the use of “standardized” rates, which are 
summary rates adjusted to allow for age differences. The methods 
of standardization are described in textbooks of biostatistics, 

In addition to age, sex and ethnic groups may differ with re- 
spect to distribution by almost any other descriptive variable. 
Thus ethnic groups in the same community often differ with re- 
spect to socio-economic status and occupation. Similar procedures 
to those used to eliminate the effect of age may be applied. 
Thus, if a difference between whites and Negroes is observed, it 
would be logical to compare whites and Negroes within restricted 
ranges of socio-economic status and/or within restricted catego- 
ries of occupation. If the differences between whites and Negroes 
persist within such restricted subgroups, it is unlikely that dif- 
ferences in socio-economic status or occupation explain the ob- 
served ethnic differences. Conversely, if a difference between oc- 
cupational or socio-economic status is observed (see Chapter 9), 
it might be desirable to determine whether these differences per- 
sist in comparisons of persons of similar ethnic backgrounds. 
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Similar standardization techniques to those used for age may be 
applied to obtain summary rates after elimination of the effect of 
one or the other variable. 

The desirability of identifying the descriptive variable that 
most closely localizes differences between groups in disease 
rates frequently arises in interpreting differences between ethnic 
groups. In particular, it is desirable to determine, if possible, 
whether a difference in disease incidence is associated with the 
particular circumstances under which the ethnic group is living at 
the time of observation, or whether it isa more permanent feature 
of the ethnic group. This may sometimes be accomplished by 
making the same ethnic comparison under different local condi- 
tions. For example, the fact that stomach cancer rates are much 
higher in Japan than in the United States may be attributed either 
to an innate (for example, genetically determined) susceptibil- 
ity of Japanese compared to United States residents or to some 
environmental circumstance characteristic of life in Japan. How- 
ever, Smith (1956a) has shown that while rates of stomach cancer 
for Japanese living in the United States are higher than rates for 
whites in the same country, rates for Japanese living in Hawaii 
are higher than for Japanese living in the United States and rates 
for Japanese in Japan are even higher than for those in Hawaii. 
Further, within the Japanese Hawaiians, rates of stomach cancer 
are higher in those born in Japan than in those born in Hawaii. 
This pattern suggests that the causal factor that appears to be 
characteristic of Japanese in Japan is a characterictic of a way of 
life that is relatively susceptible to change on migration and is 
certainly not of the constancy that would be associated with 
genetic determination. 

This example demonstrates how differences first identified as 
characteristic of ethnic groups may later be shown to be related to 
rather transient features of such ethnic groups. On the other 
hand, the fact that many groups of Jewish women under a wide 
variety of circumstances have equally low rates of cervical cancer 
suggests that the causal factor (or its absence) is a relatively 
stable long-term characteristic of this ethnic group. We may note 
that, even if a characteristic is a stable characteristic of a particular 
ethnic group, it is not necessarily genetic in nature. One feature 
of the Jewish group that has been offered to explain plausibly the 
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unusual cervical cancer experience is the high rate of circumci- 
sion, a characteristic that can be considered genetic in only a most 
indirect way. 

Differences in environment between sex or ethnic groups may 
be exemplified by the tremendous number of disease-related ex- 
periences associated with differences in occupation and/or socio- 
economic status (see Chapter 9). 

Nutritional differences between ethnic groups have received 
considerable attention through attempts to relate levels of arterio- 
sclerotic heart disease in various ethnic groups to levels of intake 
of fat and other dietary constituents. Ethnic differences in vitamin 
deficiency diseases, notably pellagra, have also been shown to be 
due to differences between ethnic groups in dietary patterns. 
Dietary differences, of an as yet unidentified nature, are also 
suspected of being related to ethnic variation in stomach cancer. 
Nutritional differences also exist between the sexes and thus have 
a possible bearing on the higher frequency of obesity in females 
than in males. 

Ethnic differences in personal customs, habits and recreational 
patterns also are productive of environmental variation. For ex- 
ample, ethnic differences in quantity and quality of alcohol used 
are well known (at least at the anecdotal level), and differences 
in tobacco use are probably no less striking. Such differences exist 
also between males and females; they are entirely capable of ex- 
plaining the sex difference in lung cancer rates, and are suspected 
of association with sex variation in cancer of the upper respiratory 
and digestive tracts. Sexual practices, including contraception, 
show ethnic variation and may be related to ethnic differences in 
diseases of the reproductive organs. Likewise, a specified sexual 
practice or experience may affect the health of one sex differently 
than that of the other. | 

Differences in bodily constitution associated with variation in 
disease rates include anatomic, physiologic, and mental (attitude) 
differences. Anatomic features are perhaps more important in 
connection with sex than with ethnic differences. Apart from the 
obvious differences in organs with specifically sexual functions, 
many more subtle anatomic differences between the sexes may 

have pertinence to differences in disease incidence. For example, 
the higher frequency of arthritis of the hip and pelvic joints in 
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females has been attributed to skeletal differences and consequent 
differences in the lines of stress imposed by weight bearing. Al- 
though less common, such differences do occur between ethnic 
groups. For example, the protection against the sun’s rays con- 
ferred by skin coloring presumably is associated with differences 
noted between ethnic groups in skin cancer frequency. 

Physiologic differences between males and females are illus- 
trated by hormonal differences, suspected of association with sev- 
eral important diseases including atherosclerosis and_ several 
forms of neoplasm. More obviously, specific diseases of the en- 
docrine glands (e.g., the thyroid gland) are likely to be influ- 
enced by the total hormonal environment in which the gland is 
operating. Again, physiologic differences between ethnic groups are 
less obvious than those between the sexes. The level of immunity 
at the time of challenge with organisms is one type of physiologic 
characteristic which varies between ethnic groups. For example, 
the low frequency of paralytic poliomyelitis in Negroes in the 
United States is attributed to frequent early exposure (associated 
with overcrowding and other features of low socio-economic sta- 
tus) at an age when paralysis is rare, resulting in subsequent 
high levels of immunity. 

Mental or attitude differences as explanatory of differences in 
disease rates between sex and ethnic groups are most obvious 
with respect to the several forms of disability due to violence. 
(Figure 15). Here again, quite different explanations may be 
sought for the existence of the attitude differences in the first 
place. Levels of disease perception, as illustrated by sex and 
ethnic differences in pain tolerance, may also be thought of as 
part of the contribution of attitude differences to disease rates. 

Differences in genetic constitution may contribute to dif- 
ferences in disease rates insofar as genetic constitution de- 
termines many of the more directly associated attributes men- 
tioned above, such as, for example, skin color or the presence or 
absence of specific sex organs. 

In addition, single major genes (that is, single genes which un- 
der suitable circumstances are associated with a demonstrable 
characteristic of the persons carrying them) may be transmitted 
in association with either sex or ethnic group. In the case of trans- 
mission of such genes in a particular sex pattern, the pattern is 
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FicurE 15. Age-specific death rates attributed to suicide in Japan and in 
the white population of the United States, 1954 to 1956. (Data from the 
World Health Organization, 1958.) 


attributable to disease-associated genes being located on the same 
chromosome pair that determines sex (‘“‘sex-linkage”). Examples 
include hemophilia, certain forms of muscular dystrophy and red- 
green color blindness. Sex-linked diseases are usually readily 
identifiable when major genes of a high degree of manifestation 
are involved. , 

Ethnic concentration of genetically determined disorders, on 
the other hand, is usually due to limitation of mating to the 
particular ethnic group over many generations, and the conse- 
quent limitation to that ethnic group of genetic material which 
originally arose as new mutations in that group or was introduced 
to it by immigrants. When major genes are involved, as in the 
case of sickle-cell anemia, thalassemia and ‘Tay-Sachs disease, ge- 
netic analysis will usually identify this type of ethnic concentra- 
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tion. However, greater difficulty arises either when manifesta- 
tion of a major gene is weak and dependent on other unknown 
factors, or when many minor genes are involved, none of which 
alone has a demonstrable effect. Evidence for this type of deter- 
mination of ethnic differences is difficult to obtain. Simultaneous 
consideration should be given to collateral evidence of genetic 
determination within ethnic groups (familial incidence, twin 
studies, etc.), showing that the characteristic rate (whether high 
or low) is a stable feature of the ethnic group (see page 119), 
and demonstration that other types of explanations (discussed 
above) are not sufficient to explain noted differences. 


9 


Other Characteristics of Persons 


At this juncture, it should be restated that the variables most 
commonly used in descriptive epidemiology are mainly the ones 
for which large quantities of data are routinely assembled by vari- 
ous agencies for their own particular purposes. In other words 
they are selected because of their ready accessibility and do not 
necessarily represent ideal types of epidemiologic information. It 
also is well to note again that those variables being discussed are 
spoken of as “characteristics of persons” only for convenience, 
and not because associations with them are deemed to reflect etio- 
logic factors intrinsic within the individual. 


OCCUPATION 


Purposes and Interpretation 


Occupation, which fortunately forms a part of many routinely 
collected records, is used as a descriptive epidemiologic variable 
for a number of rather different purposes. These include the 
following: 

(1) Measurement of social or economic status. Occupation is 
one of the basic indices of the elusive concept “socio-economic 
status,” 
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(2) Identification of specific risks associated with exposure 
to noxious agents peculiar to certain occupations. This is the 
classic epidemiologic use of occupation that has resulted in the 
identification of the detrimental effects of soot, lead, benzene and 
aniline derivatives, ionizing radiation, and numerous other 
chemical and physical agents. 

(3) Measurement of the effects of different conditions of 
work. The general conditions under which an occupational group 
works, quite apart from exposure to specific physical and chemi- 
cal agents, may play a part in determining its disease experience. 
While associations dependent on such circumstances may be less 
strong than those dependent on specific chemical or physical 
risks, they may be of even greater epidemiologic interest, since 
their relevance to circumstances outside the particular occupa- 
tional group may be far reaching. 

These three types of purpose also suggest general areas 
within which speculation as to the interpretation of a noted oc- 
cupational difference in disease rates might lie. However, before 
assuming an interpretation as specifically related to occupational! 
conditions as suggested by any of these three areas, one must con- 
sider the alternate types of explanation referred to in Chapter 8, 
particularly those included in “errors of measurement,” and “dif- 
ferences between groups with respect to more directly associated 
variables.” Errors of measurement may clearly influence occupa- 
tional comparisons when perception, diagnosis, or reporting of 
illnesses differ between compared groups. Differences between oc- 
cupational groups with respect to more directly associated varia- 
bles arise because of the marked influence of variables such as 
sex, age and ethnic group on choice (or availability) of occupa- 
tion. In addition to demographic variables such as these, charac- 
teristics even more directly indicative of health status, such as 
temperament, physique and physical handicaps, play a major part 
in the selection of occupation, and must be considered in the 
interpretation of statistical associations between occupation and 
health status. 


Occupation as an Index of Socio-economic Status 


By far the largest body of data relating to occupation and dis- 
ease derive from the surveys of the Registrar General for England 
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and Wales, in which deaths in census and adjacent years have 
been related to the occupational distribution of the population as 
determined by the census. These surveys have been undertaken 
periodically since 1851, the latest reports being for the years 1930 
to 1932 (1938) and 1950 (1954). The reports have covered both 
occupied and retired males aged 16 and over, and beginning with 
the reports for 1930 to 1932 have also dealt with single occupied 
women and with married women classified according to the oc- 
cupations of their husband. 

Occupations have been divided into five broad classes — Pro- 
fessional (I), Intermediate (II), Skilled (III), Partly Skilled (IV) 
and Unskilled (V). In 1950 certain subclasses of classes II, IV, 
and V were also examined. In the analysis of this material, in 
addition to deriving age-specific mortality rates in each social class, 
considerable use has been made of the standardized mortality 
ratio (the number of deaths, either total or cause-specific, in 
a given occupational group expressed as a percentage of the num- 
ber of deaths that would have been expected in that occupational 
group if the age-and-sex-specific rates in the general population 
had obtained). These ratios for death from all causes and for 
certain causes that manifest marked socio-economic gradients 
are shown in Table 18. Mortality generally increases with de- 
creasing social class, although quite divergent trends are seen for 
different diseases. The irregularity of the trends in the data for 


TABLE 18 


Standardized mortality ratios for males aged 20 to 64 in five occupa- 
tional classes, England and Wales* 


All causes 1950 
Occupational 3 Coronary 

(“social’’) class 1921- 1930- Tuber- Bron- Gastric heart Leu- 
1925 1932" 1950 culosis chitis ulcer disease kemia 

I (Professional) 82 90 97 64 33 56 153 150 
II (Intermediate) 94 94 86 62 53 81 101 110 
III (Skilled) 95 OF ae 103 97 a7 107 104 
IV (Partly skilled) 101 102 94 95 103 ua 81 79 
V (Unskilled) 125 EE oie 149 172 144 88 89 


* From the Registrar General’s Decennial Supplements (1938, 1954). 
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1950 may be because they so far represent material based only on 
a sample survey of the population and on deaths for 1 year. A 
larger study covering mortality during 5 years is planned. 
Occupation of father may also be used to classify infants ac- 
cording to socio-economic status. Infant mortality rates derived 
in this way are shown in Table 19. Infant mortality increases 
sharply with decreasing socio-economic status, and although total 
rates have fallen markedly between 1921 and 1950, the relative 
disparity between socio-economic groups has not changed. 


TABLE 19 
Infant mortality rates per thousand related to occupational class 
of father* 
Aged less than 1 year 1950 
Occupational Cee nn EEE 
class of father 1930- Aged under Aged 4 weeks 
192] 1932. 1950 4 weeks to 1 year 
I (Professional) 38.4 32.7 17.9 12.9 +0 
II (Intermediate) 55.5 45.0 Be 4 16.2 6.0 
III (Skilled) 76.8 57.6 28.1 17.6 10.5 
IV (Partly skilled) 89.4 66.8 BO.t 19.8 13.9 
V (Unskilled) 97.0 wl 40.7 219 18.8 
Total 79.1 61.6 24.3 18.1 11.2 


* From the Registrar General’s Decennial Supplements (1938, 1954). 


Special Risks 


Standardized mortality ratios for specific occupations have also 
been presented in the Registrar General’s Decennial Supple- 
ments, and have given valuable leads in the identification of spe- 
cific risks that might be associated with the physical and chemi- 
cal exposures of certain occupations. ‘There have also, of course, 
been innumerable special studies of occupational risks, to the 
extent that the area now constitutes a discrete medical specialty. 
The methods are in general similar to those of cohort studies 
of other special groups exposed to supposedly noxious circum- 
stances, and are dealt with later (see Chapter 13). Hueper 
(1952) has presented a comprehensive review of neoplastic dis- 
orders known to be associated with occupation, and also (1950) 
detailed methods for occupational cancer surveys. 
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Conditions of Work 


The variations in conditions of work that occur both between 
and within occupations have received increasing attention in the 
light of a number of recent studies, particularly of coronary heart 
disease and mental disorders. For example, Morris and his col- 
leagues (1953) have shown that the comparative frequencies of 
coronary heart disease in several occupations suggest lower fre- 
quencies and less severe disease in active than in sedentary occu- 
pations. Persons in different levels of authority in the same in- 
dustry have been compared with respect to hypertension and 
arteriosclerosis (Lee and Schneider, 1958) and to neurosis 
(Fraser et al., 1947). Friedman et al. (1958) have studied the 
blood cholesterol levels of accountants during periods of occupa- 
tional stress, noting significant increases during such periods. 
Cairns and Stewart (1951) have related tuberculosis rates to 
conditions of work within a section of the British shoe industry. 


SOCIO-ECONOMIC STATUS 


Measurement 


Association of disease frequency with socio-economic status, as 
with other descriptive variables, is measured by stratification of 
the population according to arbitrary levels of the variable and 
derivation of disease frequency rates in the several subgroups so 
formed, with due allowance for age, sex, and racial variations 
between the populations of the subgroups. A difficulty in this case 
is that socio-economic status is a theoretical concept still awaiting 
clear specification of meaning. So many variables, such as occupa- 
tion, family income, living conditions, social prestige, and so on, 
are encompassed that, in practice, one variable which can be objec- 
tively defined is commonly used as an indirect indicator for epi- 
demiologic purposes. Since each variable may measure a different 
component of the socio-economic complex, it is not surprising that 
associations with “socio-economic status” may differ according to 
which measure is used. It would seem more objective to describe 
such associations as associations with the particular variable used 
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(e.g., income), rather than as associations with the general concept 
of which the selected variable is only a part. 

Occupation. The use of occupation in this context has already 
been described. As noted, this variable can be used to stratify re- 
tired persons (on the basis of past occupation) and married 
women and children (on the basis of the occupation of the family 
wage earner), as well as the wage earners themselves. 

Family income. Income might be thought of as a more direct 
measure of the concept of socio-economic status than is occupa- 
tion. The use of this variable is limited, however, by the lack of 
availability and comparability of numerator and denominator 
data for the computation of rates. Although data on income are 
collected for the population in the decennial censuses, they are 
not so generally available for ill persons as are data on occupa- 
tion. Furthermore, even when income data are available for sick 
persons, as, for example, in hospital records, it is questionable 
whether the method of ascertainment is sufficiently similar to 
that of census data to allow meaningful comparisons. Family in- 
come has been most satisfactorily used in morbidity surveys based 
on household interviews, when data on income and other general 
characteristics can be assembled for the population at the same 
time that its health status is being ascertained. Data from the Na- 
tional Health Survey are shown in Figure 16. For both sexes (al- 
though more so for males than females) the average number of 
days of bed-disability decreases regularly and strikingly with in- 
creasing income. 

Various measures of income have been used. These include 
total family income, average family income per person in the 
household, and more complicated measures such as average fam- 
ily income per “adult unit,” children being counted as certain 
percentages of an ‘adult unit.” Actually, all these measures are 
highly correlated and the choice of one instead of another 1s 
usually not critical; the measurement of disease frequency is not 
so precise as to be affected by the minor variations caused by 
application of the different methods of computation of income. 

Area of residence. For most cities, census data are available 
that allow classification of census tracts or other small administra- 
tive units of the city according to some measure of socio-economic 
status. Thus, census tracts may be classified according to the me- 
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BEO—DISABILITY DAYS PER PERSON PER YEAR 


ANNUAL FAMILY INCOME ($ THOUSANDS) 


FicurE 16. Number of bed-disability days per person per year related to sex 
and family income in the United States, July 1957 to June 1958. (Data from 
the National Health Survey, 1959.) 


dian income of their populations, the per cent of the population 
with incomes below a certain level, the per cent of the population 
in certain occupational categories, or measures of housing stand- 
ards such as rental or degree of crowding. Criteria of greater in- 
timacy have included the proportion of servants to those served, 
and the proportion of houses with baths. Disease frequency rates 
may be computed and compared for residents of areas classified 
according to one of these indices. Although the socio-economic 
status of the ill persons is not known individually, it is inferred 
from their area of residence. Data based on first admissions to 
mental hospitals from the City of Birmingham, England, between 
1950 and 1952 are shown in Figure 17. The several wards of the 
city were classified according to the percentage of dwellings lack- 
ing certain amenities (Lowe and Garratt, 1959). 
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Ficure 17. Average annual first admission rates to mental hospitals related 
to area of residence, Birmingham, England, 1950 to 1952. (Data from Lowe 
and Garratt, 1959.) 


The reliability of this method varies greatly from one city to 
another and may vary also within the same city from time to 
time. While there is no question as to the tendency for the resi- 
dences of populations of low and high economic status to cluster 
in certain parts of cities, the administrative areas for which the 
necessary census data are available may not be small enough to 
discriminate clearly between such clusters. If differential disease 
rates are demonstrated by this method, then (subject to the usual 
precautions) a socio-economic association can probably be as- 
sumed. On the other hand, if no association is demonstrated, two 
alternative explanations must be considered: (1) that the dis- 
ease is not associated with socio-economic status, or (2) that the 
classification of areas is not sufficiently discriminating to detect a 
socio-economic disease association, which in fact may exist. For ex- 
ample, Martin (1955) showed that the correlation between certain 
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indices of overcrowding and infant mortality in the twenty-nine 
Boroughs of London was very high at the beginning of this cen- 
tury, but was virtually nonexistent in the years 1950 to 1952. 
Since other indices, such as occupation (see Table 19), suggest 
that the correlation between infant mortality and socio-economic 
status is still high, it is likely that the change noted by Martin is 
due to a change in the characteristics of the London boroughs, 
rather than in the association of infant mortality with socio-eco- 
nomic status. 

Combination of variables. Occasionally, an investigator will de- 
vise an index combining a number of variables by an arbitrary 
formula to fit in with his concept of socio-economic status. The in- 
dex might be based on a weighted combination of such items as 
income, rental, family size, occupation, education and residence. 
The latter three, for example, were used by Hollingshead and 
Redlich (1958) in their study of mental disorders. While such 
an index has usefulness in an individual study, it has the dis- 
advantage of rarely being used in its same form by other investi- 
gators, and therefore hindering comparisons of results of several 
studies. 

Other indices. Various other indices have been used and are 
appropriate for particular situations. For example, there have 
been comparisons of mortality among taxpayers and nontaxpayers 
(Chapin, 1924), and among the families of British peers and the 
general population (Bailey and Day, 1861). During the last cen- 
tury, comparisons of disease frequencies in army officers and pri- 
vate soldiers were popular. | 


Interpretation 


The many different environments experienced by persons of 
different socio-economic levels must be considered before expla- 
nations of noted variations in disease frequency are proposed. 
These include access to and use of facilities for medical care, edu- 
cation and other cultural characteristics, nutrition, crowding, and 
special circumstances related to the spread of microbiologic 
agents. An example that helped to elucidate the etiology of pel- 
lagra is shown in Table 20. 

Before one speculates on possible causal mechanisms it is neces- 
sary, as in the case of other descriptive variables, to consider er- 


Other Characteristics of Persons 133 


rors of measurement, the influence within socio-economic groups 
of other variables such as ethnic group, and the question whether 
occupancy of a low socio-economic group may itself have been 
determined by incipient or actual disease occurrence rather than 
the other way round. This possibility is, of course, important in 
social systems in which it is difficult to undergo long-term illness 
without bankruptcy; socio-economic associations with rates of 
prevalence or death from chronic illnesses should invariably be 
examined with this possibility in mind. Associations based on 
incidence rates will be less affected, particularly if the illness in- 
volved is one without prolonged prodromata. 


TABLE 20 


Rates of pellagra related to family income in seven cotton-mill villages 
of South Carolina, 1916 * 


en 


Half-month Both sexes 
family income Number of Rate per 1000 Crude rate per 1000 
per “adult persons a oe 
sapere Crude standardized Males Females 
< 6 1312 42.7 41.0 30.8 54.4 
6-7 1037 26.0 24.8 Ne 40.7 
8-9 784 12.8 14.2 10.7 14.7 
10-13 736 =| ose 0.0 8.0 
14+ mal 3.4 BD 6.2 0.0 
Total 4160 23.3 29.5 14.9 a15 


* From Goldberger et al. (1920). 


Even when costs directly associated with the illness (e.g., the 
costs of medical care) have not determined the socio-economic 
levels measured in the study, there may be difficulties of this type 
if the disease under study has prodromal or preclinical stages that 
might affect a person’s earning ability. For example, considerable 
controversy has centered around the question of whether the 
higher frequencies of diagnosed mental disorders that have been 
observed in lower socio-economic groups truly results from a 
higher frequency of occurrence among these groups or from a 
social “drift” that might be expected of persons with incipient 
undiagnosed mental disorders. There is no ready solution to this 
question, although with a decreasing interval between the obser- 
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vation of the difference and the onset of manifest incapacity the 
uncertainty should correspondingly become less. 


MARITAL STATUS 


General Mortality 


Unlike socio-economic status, marital status is a variable which 
is easy to measure; ready stratification of the population may be 
made into four major groups — single, married, widowed and 
divorced. The problem of ensuring similar definition of these 
terms in both the affected individuals and the population to 
which they are referred for the derivation of rates is an important 
though not difficult one. 

While marital status appears routinely on death certificates and 
in census records, data on deaths in relation to marital status are 
not published routinely in this country. They have, however, been 
the subject of special reports of the National Office of Vital Statis- 
tics for the years surrounding the censuses of 1940 and 1950 
(1945, 1956). Age-standardized rates for 1949 to 1951 are shown 
in Table 21. They demonstrate a relationship that is characteris- 


TABLE 21 


Mortality rates* related to marital status in the white population aged 
20 or more, United States, 1949-1951 + 


Marital status Total Male Female 
Single 15.6 20.6 10.8 
Married 11.3 13.1 8.7 
Widowed 14.2 21.4 to 
Divorced 2255 26.7 13.7 
Total 12.6 ) 15.1 12.4 


SP DR © adn ee RS a9 2 Ay ace 


* Rates are average annual mortality rates per 1000 population. Standardization 


is to the age distribution of the total United States population aged 20 and over 
in 1950. 


¢ Data from National Office of Vital Statistics (1956). 


tic — the lowest mortality being for married persons and the 
highest for widowed and divorced. This relationship is seen for 
both sexes. 


Table 22, also calculated from age-specific rates given in the 
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1950 report, shows that the relationship between the rates for 
single and married persons is similar for most of the major causes 
of death. 

TABLE 22 


Mortality rates* from selected causes in single and married white 
persons aged 20 and over, United States, 1949-1951} 


Males Females 
Cause of death -nmanne ea tp ————————s 
Single Married Single Married 
Respiratory tuberculosis 96.7 ye 25.0 114 
Tuberculosis, other forms 34 LS 1.8 0.8 
Syphilis 14.8 6.6 2.0 ja 
Malignant neoplasms: 
Total 260.0 207.1 470.4 182.5 
Buccal and pharynx 13.7 7.0 1.6 1.8 
Digestive organs 115.4 87.6 73.0 65.5 
Respiratory system 44.5 35.5 7.8 (4 
Breast 0.7 0.4 54.0 $5.2 
Genital organs 25.7 25.0 40.0 38.2 
Urinary system 15 130 6.6 5.8 
Leukemia 8.0 8.8 5.6 5.8 
Diabetes mellitus 21.9 18.3 16.5 29.0 
Vascular lesions of C.N.S. 182.1 138.9 136.6 124.7 
Rheumatic heart disease 28.7 18.6 23.8 4 
Arteriosclerotic heart disease 525.6 409.9 212.0 179.1 
Chronic nephritis 33.0 23.1 20.2 18.4 
Influenza and pneumonia tod 27.0 $4.1 18.3 
Peptic ulcer 21.5 11.8 38 a 
Motor vehicle accidents 61.9 oa 11.0 10.8 
Other accidents 99.2 46.1 36.4 21.1 
Suicide 43.7 20.5 7.7 6.2 


aaa ee . a 


* Rates are average annual rates per 100,000 population, standardized as in 
Table 21. 
+ Data from National Office of Vital Statistics (1956). 


Three major explanations of such a relationship are: 

(1) Persons in poor health, or in the presymptomatic stages 
of ill health, tend to remain single. 

(2) Persons who “live dangerously,” and are consequently 
exposed to a wide variety of disease-producing agents and situa- 
tions, tend to remain single. 

(3) There are differences in the ways of life of single and 
married persons that result from the presence or absence of the 
married state and that are causally related to certain diseases. 
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It is likely that all three of these explanations play a role in 
some of the noted relationships. Studies of cancer and mental dis- 
orders serve to illustrate some of the possibilities. 


Cancer 


Table 22 shows that a number of sites of cancer exhibit fre- 
quency differences between married and single persons, but in- 
terest so far has largely focused on cancers of those organs of the 
female most directly involved in reproductive and other marital 
functions. It is desirable, if possible, to separate the effects of re- 
production or fertility from other concomitants of the married 
state. If the data are available, this can be done by comparing 
single women with infertile married women. An example is 
shown in Figure 18. It is seen that for carcinoma of the cervix 
rates for married infertile women are almost twice as high as 
those for single women. Carcinoma of the corpus uteri, on the 
other hand, appears about equally frequent in these two groups, 
although married women as a whole would show lower rates than 
single women because of the lower rates evident for fertile 
women. Similarly, it has been shown that married women have 
lower rates of breast cancer than do single women; this in part 
can be attributed to a lower rate of breast cancer among women 
who have borne children, but it is also evident when single and 
married infertile women are compared. 

It is likely that the associations of carcinoma of the cervix and 
of the breast with marital status are both explicable in terms of 
marital function. That cervical cancer is extremely rare in nuns 
and that rates show an inverse relationship with age at marriage, 
independent of fertility, suggest that frequency of sexual inter- 
course may be the variable responsible for the descriptive associa- 
tion with marital status. Lilienfeld (1956) has shown that artifi- 
cial menopause confers some degree of protection against breast 
cancer, and that married women more frequently have the meno- 
pause induced artificially than do single women. This relation- 
ship is also independent of differences in fertility, and may ex- 
plain the difference between single and married infertile women 
in the frequency of breast cancer (MacMahon and Feinleib, 
1960). 
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Ficure 18. Equivalent average death rates from cancer of the uterus related 
to marital status and fertility in England and Wales, 1948 to 1949. An equiva- 
lent average death rate is the sum of the age-specific death rates divided by 
the number of age groups summed. It is the equivalent of standardization to 
a standard population which has equal numbers in each age group. In the 
present case, data are restricted to females aged 15 or more. (From Logan, 


1953.) 


Mental Disease 


Point prevalence rates of persons in mental hospitals in the 
United States in 1950 are shown in Figure 19. The most striking 
difference is between the rates for single and married persons. 
In addition to the magnitude of the difference, there is an inter- 
esting reversal of the sex pattern between the two marital status 
groups. Among single persons, rates for males are higher than for 
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FicurrE 19, Point prevalence of patients in mental hospitals per 100,000 popu- 
lation according to age, sex and marital status, United States, 1950. (Data 
from U, S. Bureau of the Census, 1953b.) 
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females, yet among married persons the converse is true for all 
age groups except the youngest (14-24 years). Similar relation- 
ships can be demonstrated in incidence (first admission) data. 

These relationships have a variety of possible explanations, as 
previously suggested. If the explanation is based on the premise 
that persons with manifest or incipient mental disorders tend to 
remain single, then the effect of this selection must either be 
greater for males than for females or there are additional etio- 
logic influences which affect single males and married females. An 
observation that is difficult to explain on the basis of selection of 
marital status is the generally higher rates of mental disorders for 
widowed than for married persons, particularly in the younger 
age groups. The most likely explanation of this relationship 
seems to be that the stress of bereavement or of widowhood has 
a causal relationship to the manifestation of mental disorder. 

Still a further explanation of the difference between the rates 
for married and single persons has to do with the use of mental 
hospital data as estimates of the frequency of mental disorder. It 
is possible that the actual frequencies of disorder are the same 
but that single persons are hospitalized more frequently because 
of the absence of alternative care. Here again, the high rates for 
widowed persons do not seem to conform to such a hypothesis, 
since it is difficult to believe that alternative accommodation 
would be less readily available to young than to old widows and 
widowers. 


Concordance between Marital Partners 


Another area of investigation deals with the possible influence 
of a common environment on the health of marital partners, 
studies being conducted to determine whether husband and wife 
tend to have the same disease experiences. Ciocco (1940) found 
among the spouses of persons who died from tuberculosis, pneu- 
monia, heart disease or cancer a higher than expected number 
who died of the same condition. In morbidity data, Downes 
(1947) found that pairs of husbands and wives both had a chronic 
disease significantly more frequently than would be expected by 
chance; there was a significant association of the same illness in 
both spouses for hypertensive vascular disease and arthritis. 

Such studies are useful in pointing to disorders in which causal 
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environmental circumstances may be sought in experiences that 
are common to husbands and wives. The possibility that marital 
partners are consciously or unconsciously selected with respect to 
characteristics that will determine the development of similar dis- 
eases (e.g., ethnic group, or physical attributes) must of course be 
given consideration. 


FERTILITY 


Rates for a few disorders among females have been related to 
the number of children borne. Disorders of pregnancy provide 
the most obvious of such relationships. Toxemia, placenta previa 
and uterine efficiency, for example, show marked associations 
with maternal parity. Mechanical disorders of the reproductive 
tract, such as uterine prolapse, also show relationships to maternal 
fertility. 

Examples of a less immediate relationship are that rates of 
cervical cancer increase and that those of breast cancer decrease 
with increasing parity. Wynder (1955) and others have suggested 
that the association of cervical cancer with parity may be sec- 
ondary to associations of each with age at first marriage, but no 
satisfactory explanation of the inverse relationship between 
breast cancer rates and parity has been substantiated. 

Although there has been some work on time of occurrence of 
psychosis in relation to child birth (see Chapter 11), no data are 
available on the risk of mental disorders in relation to number 
of children borne. With the great amount of information on the 
fertility of the female population which is collected by the cen- 
suses of most countries, it is surprising that epidemiologic use of 
this descriptive variable has been virtually restricted to studies of 
uterine and breast cancer. In addition, if there were census data 
available for male fertility, it would seem reasonable to examine 
male disease patterns with respect to fertility. 


10 
Place 


The importance of variation in the frequency of different dis- 
ease manifestations from place to place has long been recognized. 
It has formed the basis of much reasoning leading to hypotheses 
of etiology. In addition, knowledge of the geographic distribution 
of disease has obvious utility for administrative purposes. 


METHODS 


Early knowledge of the geographic distribution of disease was 
entirely qualitative, and derived from the sporadically published 
observations of medical practitioners. Hirsch’s three-volume 
Handbook of Geographical and Historical Pathology, translated — 
into English in 1883, is a gargantuan compendium of such ‘ob- 
servations and is regarded by many as the beginning of modern 
epidemiology. Because of its reliance on qualitative data, perhaps 
it should instead be regarded as the full flowering of anecdotal 
epidemiology rather than as the beginning of a new approach. 

The quantitative knowledge of present-day epidemiology is 
based in part on special surveys and in part on analysis of rou- 
tinely collected and published vital data. The use of routinely 
collected statistics of causes of death and notifications of infectious 
diseases has the advantage of allowing exploratory examinations 
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of a large number of diseases to determine the likelihood of sig- 
nificant geographic variation. When probable differences are 
demonstrated, special confirmatory surveys may be instituted. 
However, special surveys are expensive and unlikely to be under- 
taken unless routine data suggest specific hypotheses which need 
testing. 

On the other hand, routine data have a number of serious dis- 
advantages. First, there are the difficulties consequent on the 
variations that exist from place to place in standards of medical 
care, diagnosis and reporting of illness or death. Second, there is 
the virtual restriction of such data to infectious diseases and dis- 
orders reported as causes of death. Third, the limits or regions for 
which the data are reported are usually administrative, and the 
geographic factors influential as determinants of disease frequency 
may not be so accommodating as to approximate administrative 
boundaries; consequently, failure (with routinely available data) 
to demonstrate geographic differences in the distribution of a 
disease does not rule out the possibility that place may indeed 
play a role in its occurrence. 

According to the scale of the map, we may distinguish several 
types of comparison that are commonly undertaken. It should be 
clear, of course, that these comparisons must be considered in the 
light of the effects of differences in age and sex composition be- 
tween the compared populations. 


International Comparisons 


Data from many nations relating to causes of death and noti- 
fications of infectious diseases are currently collected and periodi- 
cally published by the World Health Organization. Although the 
published data are restricted to those countries from which re- 
porting is believed to be most adequate, the difficulties associated 
with differences in diagnosis and reporting are more serious in 
comparisons of data between nations than those associated with 
most other geographic data. Uncertainty whether even gross dif- 
ferences may be due to such artifacts is felt particularly when 
they are found in areas outside of Europe, North America and 
Australia. Nevertheless, the gross differences that have been dem- 
onstrated, and even those that are less striking, should not be 
written off as inconsequential simply because of this uncertainty. 
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They will no doubt be susceptible to increasingly critical exami- 
nation as reporting procedures improve. 

Most striking, of course, are differences between tropical and 
temperate areas in the frequency of major parasitic diseases. In 
such situations, where both clinical opinion and statistical mate- 
rial indicate that certain diseases that are epidemic in particular 
areas are rare in others, there can be little doubt that the differ- 
ences are real. Less obviously real, and therefore all the more 
provocative, are the unusual distributions observed for certain 
noninfectious diseases. Japan, for example, with relatively low 
rates of death from coronary artery disease, has very high rates 
of hypertension and cerebrovascular accidents; unusually high 
death rates from stomach cancer exist in Germany and Iceland, 
as well as in Japan; Great Britain is worst off with respect to 
lung cancer, although the United States, while not in quite the 
same class, is not very far behind. If the United States needs 
honors it can get them in the fields of coronary artery disease and 
leukemia, although for the latter the distinction must be shared 
with Denmark and Sweden. Perhaps the main value of statistical 
data such as these is to provide a mechanism for the broad classi- 
fication of countries into those with probably high, probably 
average and probably low rates of an individual disease; the data 
appear too fragile to undergo a finer classification at the present 
time. Nonetheless, they do constitute the most important source 
of information on international variation in disease frequency 
available at the present time and deserve serious attention. 

Special studies of international variation are well illustrated by 
those in which the purpose has been to correlate coronary artery 
disease frequencies with the dietary constituents in various parts 
of the world. Special clinical studies have been made among 
eroups of people in areas selected because of observed dietary 
patterns (see, for example, Scrimshaw et al., 1957). Valuable in- 
formation on the distribution of atherosclerosis and on the levels 
of serum cholesterol has been obtained, although there is still 
no consensus as to which dietary constituents are most closely 
correlated with the variation in coronary disease frequency 
(Yerushalmy and Hilleboe, 1957). Particularly interesting in 
such studies are attempts to rule out ethnic differences as being 
explanatory of observed geographic differences. For example, 
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Miller and his colleagues (1958) found that the serum choles- 
terol levels of men whose parents were born near Naples but who 
themselves had lived all their adult lives in the United States, 
while comparable to those of other Americans, were significantly 
higher than those of native Neapolitans examined by Keys et al. 
(1954). 

In many international studies of this type, information both 
on the collateral variables (such as diet) and on health and dis- 
ease characteristics (such as cholesterol level) is obtained from 
the same persons. In this type of study the fact that the group ob- 
served may not be representative of the population residing in 
the area is not crucial. However, if disease rates derived from 
such special study groups are to be compared with characteristics 
which are representative of the population of the area, the selec- 
tion of a representative sample in the special study group does 
become critical and considerably increases the technical difficul- 
ties which must be faced. 


Variation Within Countries 


Just as death statistics and notifications of infectious diseases 
constitute the major source of knowledge of variation in disease 
frequency between countries, so they are the major sources of 
data for comparisons within countries. Again, use must be made of 
the administrative areas for which deaths and population bases 
are reported. In general, the smaller (geographically) the admin- 
istrative areas examined, the more accurately the boundaries of 
areas of high or low disease frequency can be drawn, provided 
that the individual areas have populations large enough to yield 
reliable estimates of disease frequency. 

For example, Figure 20 shows standard mortality ratios for 
deaths from leukemia in each state of the United States, while 
Figure 21 shows data calculated for individual “economic sub- 
regions,” the latter being smaller population groups that incor- 
porate several contiguous counties within states. At a distance, 
the impression obtained from the two figures is similar — both, 
for example, indicate low frequencies in the Southeast and a 
pocket of high frequency in the North Central part of the coun- 
try. However, when examined more closely the maps show some 
significant dissimilarities. For example, Figure 20 suggests that 
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rates are very high for Minnesota and that an area of high rates 
extends across Wisconsin to the shore of Lake Michigan. But in 
Figure 21 it is seen that the highest frequencies actually are found 
in Western Minnesota, extending into parts of the Dakotas. Simi- 
larly, Figure 21 shows that the high rate for New York State com- 
pared with surrounding states (seen in Figure 20) is in fact a 
characteristic of the New York Metropolitan Area, but not of up- 
state New York. 

A detailed geographic delineation of disease frequencies may 
not always be required. There may, for example, be interest 
primarily in the question of whether there is a difference in fre- 
quency between the northern and the southern parts of a coun- 
try. Even so, an examination of data for small administrative 
units will help to clarify whether a difference noted between the 
north and the south (when examined as two large units) reflects 
a difference of the areas in general or is due to the presence 
within one or the other of geographic localizations of very high 
or low frequency. The presence of a few pockets of high fre- 
quency in one-half of a country would obviously require a dif- 
ferent interpretation than the finding of a widespread higher fre- 
quency in one-half compared to the other, the latter finding, for 
example, being more meaningful in terms of the dichotomy 
north and south. 

Classification of places within a country need not be made on a 
spatially contiguous basis. For example, it may be desirable to 
compare counties or states having large agricultural populations 
with those having small farming populations. The same possi- 
bility — that one or the other category might show a high disease 
frequency rate because it includes one or two areas of very high 
frequency (because of an unsuspected reason) — can be examined 
by comparison of areas classified according to levels of the rele- 
vant variable rather than according to a simple geographic di- 
chotomy, or by use of a two-way table arranging varieties of areas 
into those having rates above and below average levels of disease 
frequency. 

The examination of a geographic variation may go beyond a 
description of the levels of variation observed to the development 
and exploration of hypotheses attempting to explain the noted 
variation in terms of other characteristics known about the regions. 
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This further exploration may involve comparison of the disease 
frequencies with the frequencies of those characteristics suspected 
of being relevant, by means of correlation scatter diagrams or by 
more formal methods of mathematical correlation. An example 
(Figure 22) is to be found in the work of Dean (1946) and 
others in the identification of a major determinant of the geo- 
eraphic variation in frequency of dental caries — the fluoride con- 
tent of drinking water. 
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FicurE 22. Fluoride content (parts per million) of drinking water correlated 
with dental caries experience and with evidence of any degree of dental fluoro- 
sis (including ‘‘very mild’) in white school children aged 12 to 14 years in 
twenty-one cities of four states. (Data from Dean, 1946.) 


Such information of course is not derived from routinely pro- 
cessed data but from special surveys designed to examine par- 
ticular hypotheses. Special surveys both of animals and of man 
are frequently undertaken to develop an understanding of the 
distribution of infectious diseases. In man such surveys may in- 
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volve search for existent clinical cases of the disease. Frequently, 
however, they are concerned mainly with evidences of residua 
of infections — for example, the presence of antibodies in the 
serum, trachoma scars or splenic enlargement. Clearly, examina- 
tion of populations for scars and other leftovers from past infec- 
tions is more rewarding from the standpoint of numbers found 
affected than is a search for current cases. It should be noted, how- 
ever, that the information derived refers to a peculiar form of 
point prevalence — the frequency of having had and survived a 
past infection. The evaluation of such a rate is necessarily difficult. 

A social phenomenon that involved the clinical examination of 
certain age groups of men from all over the United States and 
was consequently of great convenience to the geographically 
minded epidemiologist, if to nobody else, was the mobilization of 
the Armed Forces during World War II. For example, several 
studies of the prevalence of dental caries in relation to area of 
residence of inductees were undertaken, a summary of which is 
shown in Figure 23. Similarly, Gentry et al. (1955) used data 
from the same source to examine the area of residence of induc- 
tees with sarcoidosis. The latter study provides a good example 
of the relationship between ethnic group and geography. Since 
sarcoidosis occurs much more frequently among Negroes, the 
finding that most cases came from the southeastern part of the 
United States at first sight was not surprising. However, even 
when allowance was made for the geographic distribution of the 
Negro population, there still appears to have been a geographic 
concentration of the disease in this area and to certain smaller 
areas within it. 


Urban-Rural Comparisons 


A special type of intracountry comparison that does not involve 
classification into contiguous areas is the examination of differ- 
ences in disease frequency according to degree of urbanization. In- 
creasing use of this variable has demonstrated that economic and 
industrial development is not an entirely unmitigated blessing. 

Both the U. S. National Office of Vital Statistics and the Regis- 
trar General for England and Wales publish tabulations of cause 
of death according to urban or rural residence. In the United 
States the division is along county lines, counties being classified 
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according to whether they lie within the Standard Metropolitan 
Areas established by the Bureau of the Census (‘‘metropolitan 
counties’’), or not (“non-metropolitan counties’). A special tabu- 
lation of deaths for the years 1949 to 1951, undertaken in connec- 
tion with the Air Pollution Medical Program of the U. S. Public 
Health Service, provides age-standardized death rates for 102 
causes of death in 163 Standard Metropolitan Areas (Manos, 
1957). In addition to allowing comparison of the metropolitan 
areas with the rest of the country, these data allow comparisons of 
the individual metropolitan areas. A distinction is also drawn be- 
tween counties with a central city in a Standard Metropolitan 
Area and counties that are peripheral to a large city although 
still included in a Standard Metropolitan Area. The latter are 
presumably less urbanized than the former. 

Data for England and Wales published by the Registrar Gen- 
eral provide even more detail for study of urban-rural differences. 
‘Tabulations of deaths by cause, age and sex are given for six 
large “conurbations” (coalescenses of major towns), urban areas 
outside the conurbations divided into three groups according to 
population size, and rural areas. 

Table 23 illustrates urban-rural comparisons for selected causes 
of death as derived from the data for 163 Metropolitan Areas in 
the United States. Death rates are generally higher for urban 
than for rural areas, both for all causes of death and for many 
individual causes. The data for deaths from “symptoms, senility 
and ill-defined conditions” indicate that diagnostic and reporting 
difficulties may enter into these comparisons, although, at least 
in the United States and Great Britain, the effect of such possible 
differences is small. The most striking excess rates in urban, by 
comparison with rural, areas is seen for arteriosclerotic heart dis- 
ease, tuberculosis and cirrhosis of the liver. ‘The excess rural 
death rates from nephritis and vascular lesions of the central 
nervous system in non-metropolitan areas, although small, are of 
interest because they are contrary to the general trend. 

Urban-rural comparisons are also frequently made during the 
course of special studies. For example, in the infectious diseases, 
inferences regarding mechanisms of spread have been made from 
comparisons of urban and rural areas with respect to frequency, 
age distribution and rapidity of spread of the disease. Diseases 
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spread by contact and respiratory infections are acquired later in 
life and often less frequently in the country than in the city. On 
the other hand, intestinal infections and many diseases transmit- 
ted from animals to man have greater opportunity to affect hu- 
man beings in rural areas. 

Results of two cancer morbidity studies, one a special survey 
in Iowa and the other of data from the Danish Cancer Registry, 
are shown in Table 24. Minor differences in this table must be 
interpreted with caution, since, as noted, diagnostic standards 
may well be somewhat higher in urban than in rural areas, even 
within these two limited geographic regions. However, strikingly 
higher rates for urban areas are seen in both studies for cancers 
of the pharynx, esophagus, lung, bladder and cervix. 


TABLE 24 


Urban-rural differences in incidence* of cancer of selected sites, 
Denmark, 1948-1952, and Iowa, 1950+ 


Denmark Iowa 

Site and sex Copen- Provincial Rural Urban Rural 
hagen towns areas area areas 

Lip (M) Zo 4.7 8.1 173 D./ 
Tongue (M) 1.4 1.0 0.3 1.6 1.1 
Pharynx (M) 19 0.8 0.3 4.3 1.5 
Esophagus (M) 8.0 5.1 3.8 39 z-1 
Stomach (M) 37.0 43.4 41.8 ya i oo 
Stomach (F) 23.6 26.7 29.8 1S.1 14.1 
Rectum (M) rhe ve 220 7.4 16.3 12 
Rectum (F) 13.9 IZ 11.2 10.0 11.0 
Lung (M) 43.6 15.3 8.3 29.0 10.2 
Kidney (M) 8.8 44 3.7 4.8 3:5 
Kidney (F) i 3.0 3.4 5.3 2.8 
Bladder (M) 1s 7.7 5.6 20.2 re 
Bladder (F) 4.1 eS | 6.6 3.2 
Leukemia{ (M) 8.4 7.0 6.6 14.4 11.4 
Leukemiat (F) 6.0 5.0 4.5 10.9 7.2 
Cervix (F) 38.4 B40 20.2 454 23.6 
Corpus uteri (F) 14.4 10.9 7.9 11.0 10.9 
Breast (F) 574 135 40.9 78.0 62.4 
All sites (M) 284.8 S27 185.7 350.6 250.2 
All sites (F) pla 2a150 208.6 351.6 261.5 


* Rates are annual incidence rates per 100,000 standardized to the age distribu- 
tion of the total United States population in 1950. 

+ From Clemmesen and Nielsen (1957) and Haenszel et al. (1956). 

t For Iowa, these rates include other cancers of the hemopoietic system, 


154 Epidemiologic Methods 


Local Distributions 


Spot maps showing the distribution of cases of a disease within 
the local community comprise a long-established epidemiologic 
procedure. A classic example, Snow's (1855) investigation of the 
outbreak of cholera in the Golden Square area of London in Au- 
eust and September of 1848, is illustrated in Figure 24. The iden- 
tification by Snow of the place where each of the victims of this 
outbreak was attacked (i.e., became frankly ill) enabled him to 
center attention on the common pump in Broad Street (A). He 
noted that the outbreak centered geographically around this 
pump and that the number of cases was small in areas that were 


& PUMPS 
« CASE OF CHOLERA 
@ 5 CASES 


Ficure 24. Street map of the subdistricts of Golden Square, St. James and 
Berwick St., London, showing the distribution of cholera cases in the outbreak 
of August and September, 1848. (From Snow, 1855). 
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closer to some other source of water supply. The localization of 
cases around the Broad Street pump becomes still more striking 
when it is noted that (1) although there are several cases in the 
vicinity of the pump in Marlborough Street (B) the water from 
this pump, being heavily and obviously polluted, was avoided by 
residents of this area who preferred to walk the additional dis- 
tance to the pump in Broad Street; and (2) the pump in Rupert 
Street (C) while apparently close to several cases as the crow flies, 
was not close as the water-carrier travels because of the conforma- 
tion of the streets in the area. 

While no formal reference to any population at risk is usually 
included in such a map, knowledge of the distribution of the 
population is essential to interpretation. An area of the map may 
be empty of cases merely because it is not populated. For example, 
the 5 cases shown in the workhouse on Snow’s map mean little 
unless it is known that the institution had 535 inhabitants. By 
contrast, a percussion-cap factory in Broad Street had 18 cases 
among approximately 200 workers. The workers in the factory 
used water from the Broad Street pump, but the workhouse had 
its own well on the grounds. Similarly, the absence of cases among 
workers in the brewery in Broad Street becomes of significance 
when it is known that 70 men were employed there. Dr. Snow, 
inquiring of Mr. Huggins, the proprietor, ascertained that: “The 
men are allowed a certain quantity of malt liquor, and Mr. Hug- 
gins believes they do not drink water at all; and he is quite cer- 
tain that the workmen never obtained water from the pump in 
the street. There is a deep well in the brewery.” 

The making of a spot map such as this is commonly the first 
procedure in the investigation of a localized outbreak of disease. 
When the number of persons at risk in subsections of the affected 
area is large enough, the computation of rates is preferable to the 
spot map. 

At this juncture, it is necessary to point out that comparisons 
of disease rates for areas as small as the ones just described can 
only be expected to reveal those causal factors that have operated 
at a time close to the onset of the disease. Man is currently so 
mobile, particularly in the Western culture, that he occupies dur- 
ing his lifetime a variety of living places. If, therefore, compari- 
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sons of areas are expected to reveal causal factors operative at a 
time distant from the onset of a disease, the use of the largest 
geographic units (nations) may be expected to be most revealing. 


INTERPRETATION 


The demonstration of an association of a disease with place im- 
plies one of the following: (1) The inhabitants of the particular 
place have characteristics that in themselves are of etiologic im- 
portance in the disease and that differ from those of the inhabit- 
ants of other places; (2) etiologic factors are present in the bio- 
logic, chemical and physical, or cultural environments of people 
inhabiting the affected places; or (3) both types of explanation 
are applicable. 


Characteristics of Inhabitants 


Characteristics of persons that produce associations with place 
include some of the clearly identifiable characteristics which have 
been discussed in the previous chapters. Age, for example, is so 
obvious that allowance for age differences between the popu- 
lations of different places forms part of the method of the com- 
parison and should not even enter into interpretative considera- 
tions. Mention has already been made of ethnic differences as 
determinants of geographic patterns. Religious groups also tend 
to be geographically clustered — for example, it is likely that the 
low death rates from lung cancer in the State of Utah can be at- 
tributed to the abstinence from tobacco required of the large 
Mormon population. 

On the other hand, the personal characteristics involved may 
be more subtle and not identifiable through use of gross de- 
scriptive variables. An example of genetic variation from place 
to place is the variation in frequency of sickle-cell anemia in 
Africa. Within East and West Africa, the sickle-cell trait* tends 
to be most frequent in areas in which malaria rates are high- 
est. Allison (1954) has shown that persons who are heterozygous 
for the sickling gene, although as frequently infected, are more 
resistant to the malaria parasite. It seems likely, therefore, that 


* The sickle-cell trait occurs in persons heterozygous for the gene which in 
homozygous form is associated with sickle-cell anemia. 
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genetic selection in favor of persons with the sickling trait is at 
least partly responsible for the occurrence of high sickling rates in 
districts with high malaria rates (Livingstone, 1958). On the 
other hand, it is likely also that the present distribution of the 
sickling gene accounts for some geographic variation in the dis- 
tribution of malaria as a disease manifestation, since sickling- 
trait prevalences between 20 and 30 per cent are fairly frequent 
among West African tribes. 


Characteristics Peculiar to Place 


That a noted geographic association may be explained in terms 
of characteristics inherent in the place of occurrence of a disease 
is suggested by any of the following criteria: 

(1) High frequency rates are observed in all ethnic groups in- 
habiting the area. 

(2) High frequency rates are not observed in persons of simi- 
lar ethnic groups inhabiting other areas. 

(3) Healthy persons entering the area become ill with a 
similar frequency to the indigenous inhabitants. 

(4) Inhabitants who have left the area do not show high 
rates, and if affected at the time of emigration show signs of im- 
provement or recovery. 

(5) Other species than man inhabiting the same area show 
similar manifestations. 

‘The more of these criteria that a disease satisfies, the more prob- 
ably the frequency of the disease is in fact determined by a char- 
acteristic (or characteristics) directly related to place. Although 
all of them are rarely satisfied by a particular disease, they may 
be; the classic example, endemic goiter, was clearly recognized as 
a “‘place’’ disease long before the particular cause of its distribu- 
tion was determined. The fact that one or several of the criteria 
are not satisfied does not rule out the possibility that strong 
etiologic agencies peculiar to the locality are operative in the 
genesis of a disease. For example, criterion (4) may be inappro- 
priate if the disease is one with a lengthy preclinical phase or is 
associated with progressive and irreversible pathologic change; or 
criteria (1) and (5) may not pertain in the presence of such an 
additional association as a genetic mechanism. 

It is worth noting parenthetically that differences in observed 
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geographic distributions may lead to erroneous as well as to en- 
lightening conclusions when additional associations are present. 
For example, an important argument used against the hypothesis 
of an etiologic similarity between syphilis and general paralysis 
of the insane was that general paralysis was quite rare in many 
countries where syphilis was very common (Kraepelin, 1913). 
The reason for this discrepancy, if it is a fact, is still unknown. 

If the evidence that a particular place or type of place EX€TCISES 
an etiologic influence on a disease seems reasonably convincing, 
it is possible to seek for responsible mechanisms among several 
of the broad (and not mutually exclusive) features which make 
up our concept of “place.” 

Biologic environment. Best understood of the features of places 
that contain disease-determining mechanisms are the climatic and 
other ecologic characteristics that determine the animal and vege- 
table environment within which man exists. Such characteristics 
influence disease patterns by providing temperature, humidity 
and other conditions necessary for the survival of parasites outside 
the body (for example, the cercariae of schistosomiasis, the larval 
stages of some intestinal parasitic worms, and the various lepto- 
spira); by providing conditions suitable for the reproduction of 
animal vectors (for example, mosquitos, tsetse flies, ticks and 
various ectoparasites of man and other animals); and by determin- 
ing the size and kind of animal populations available for a role 
either as host for intermediate stages of the parasites or as reservoir 
of infection, for example, mollusks in schistosomiasis and the mam- 
malian reservoirs of plague, rabies, trypanosomiasis and yellow 
fever). 

The vegetable environment, in addition to its role in the deter- 
mination of the animal environment, may have a more direct 
part, as illustrated by the geographic distribution of such prosaic 
diseases as poison ivy dermatitis and such exotic ones as ergotism. 

Chemical and physical environments. Chemical conditions pe- 
culiar to certain places are capable of influencing the health of 
man through at least two known mechanisms — peculiarities in 
the chemical composition of drinking water and of inspired air. 
Perhaps the most geographic of all diseases — endemic goiter — 
has been shown to be associated with iodine lack in the soil. 
Sway-back in sheep is also the result of a deficiency, in this instance 
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of copper. The inhalation of toxic chemicals peculiar to certain 
localities was well illustrated by the effects of the London fog 
shown in Figure | (see page 6). It seems likely, though not fully 
established, that atmospheric pollution also has more gradually 
apparent effects on human health, including a probable role in 
the etiology of lung cancer. Such an association may well explain 
the unenviable position of Great Britain with respect to death 
rates from lung cancer. 

Physical environment plays an obvious role in diseases and 
disabilities directly connected with exposure to heat, cold or lack 
of oxygen (altitude). Recently, there has been interest in the 
identification of possible variations in rates of diseases known to 
be associated with ionizing radiation, for example, leukemia and 
bone cancer, according to variation in the levels of background 
radioactivity that occur from place to place. Gentry e¢ al. (1959) 
have reported evidence of such a correlation in a study of rates of 
congenital malformation in various areas of New York State. 

Cultural environment. Social arrangements peculiar to specified 
places are major determinants of the biologic, chemical and phys- 
ical environments to which the inhabitants are exposed. These 
arrangements appear to be the main forces in control of the na- 
ture and quantity of animal and vegetable species sharing the 
environment, of the frequency and diversity of interpersonal con- 
tact, of the presence of reservoirs and vectors for spread of in- 
fection, and of the degree and kind of physical and chemical ad- 
ditions to the air and water. 

Social conditions determining preferences, availability and dis- 
tribution of food are known to be responsible for geographic 
patterns in the distribution of such disorders as beriberi, pellagra 
and obesity. In addition, the less tangible factor of social “‘cli- 
mate” peculiar to a time and place may be productive of mani- 
festations that members of other social systems may view as dis- 
ease, although unfortunately the evidence so far remains largely 
anecdotal. Hirsch (1883), for example, noted that: “Methodist 
evangelizing . . . spread like a pestilence from Ohio through 
Tennessee, Virginia, Kentucky and the Carolinas, affecting hun- 
dreds of thousands; the victims were known by various names, 
according to the form their enthusiasm took . . .” And Panum, 
during his well-known investigation of an outbreak of measles on 
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the Faroe Islands in 1846, having taken the opportunity to ob- 
serve many other facets of life in the region, formed the opinion 
that both the physical characteristics of the islands and the re- 
ligious attitudes of the people led to a type of mental disorder 
which was distinctive from disorders he had seen in other areas. 

In descriptive epidemiology there is nowhere a more obvious 
need for the development of quantitative measures than in the 
area of social arrangements and relationships. 


CHANGES IN PLACE 


Man’s increasing mobility was mentioned earlier in this chap- 
ter in connection with an epidemiologic difficulty which it en- 
gendered. However, this mobility on occasion has offered situa- 
tions useful in the understanding of disease frequency as it relates 
to place of occurrence. 

Change in place of residence and life has been used as an 
epidemiologic variable in the exploration of two general types of 
hypothesis: (1) the existing disease level of a population is af- 
fected by the entrance into it of ill (or potentially ill) persons; 
and (2) the health of those entering is affected by the act of 
migration. 

The first hypothesis has stimulated a variety of interests, rang- 
ing from the role of the mass pilgrimages to Mecca in the spread 
of cholera to studies of the influence of introducing infected ani- 
mals into laboratory populations of mice. 

The second general hypothesis that the act of migration influ- 
ences the health of migrants is given support not only by evidence 
derived from the behavior of infectious diseases, but also from 
that relevant to diseases which do not seem to be associated with 
microbiologic agents. In the instance of infectious diseases, Whee- 
lis’ work (1938), demonstrating a relationship between recent en- 
trance into the United States Navy and high rates of communi- 
cable diseases, is particularly pertinent. For the “noninfectious” 
diseases well-documented support of the hypothesis is found in the 
area of mental disorders. Psychotic disorders appear to be more 
frequent among migrant groups. 

This finding can be demonstrated in more than one way. ‘The 
comparison of rates of nativeborn with those of foreignborn per- 
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sons has been mentioned in Chapter 7. Another way, involving 
comparison between rates for whites and non-whites, requires the 
additional knowledge that migration within the United States 
has increased at a more rapid rate for the non-white than for the 
white population (Figure 25). Still another involves recency of 
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FicurE 25. Ratios of the number of persons living in different regions from 
their region of birth to the number living in the region of birth, United States, 
nativeborn population, 1890 to 1950. (Data from U. S. Bureau of the Census, 
1949, 1953a.) 
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movement. A question in the 1940 census asked ‘““Where were you 
5 years ago (April 1, 1935)?” This denominator enabled Malz- 
berg and Lee (1956) to derive rates of psychosis among persons 
known to have made a recent (5 year) move for comparison with 
rates among those not known to have made a recent move. The 
findings are summarized in Table 25. 


ll 


Time 


The omnipresence of time renders impractical any extended 
attempt to consider it as a separable item of experience or reality. 
Time has been noted as an ingredient necessary to each of the 
various epidemiologic measurements and to the very concept of 
causal association. However, in a more limited context, considera- 
tion is given here to the frequent use of time measured in certain 
arbitrary units as a general localizing variable. 


TIME IN VARIOUS LENGTHS 


Time may be thought of in whatever size we care to — an in- 
stant or an eon. When marked changes in disease frequency are 
noted over a period of time judged very short in the compara- 
tive experience of the observer, explanatory hypotheses seem eas- 
ier to propose (and are more readily accepted) than when the 
changes evolve more slowly. 


“Point” Epidemics 


Large and fleeting excesses in disease frequency are commonly 
called ‘“‘point” epidemics. ‘The same episode of cholera for which 
localization by place was shown in Figure 24 (page 154) is now 
illustrated in Figure 26 according to its occupancy in time. The 
localization by time is seen also to have been very marked. 
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Such outbreaks of disease are particularly helpful in revealing 
etiologic factors, since the pattern (place and time) suggests that 
a large number of people were exposed to a similar causal cir- 
cumstance almost simultaneously, and that the mean interval be- 
tween exposure and onset was short. The latter supposition is 
based on the knowledge that some dispersion around the mean 
induction or incubation period between exposure to disease-in- 
citing events and onset of disease is always present. For such dis- 
persion to be encompassed within a few days indicates that the 
mean must itself be quite short. Both suppositions make search 
for the causal circumstance less difficult. 

Marked rises in disease frequency occupying brief lengths of 
time have regularly been found associated with very clearly de- 
lineated circumstances such as abrupt encounters (via air, food or 
water) with microbiologic organisms or chemical agents. The rise 
and fall of epidemics is necessarily slower when engendered by 
more gradually introduced or less circumscribed forces. If lengths 
of time are conceived of in larger units, however, fluctuations in 
disease frequency that appeared gradual when smaller time units 
were used now present an appearance in time no different from 
that of a “point” epidemic. The dearth of serious attempts at 
identifying such epidemics in chronic disease is due probably in 
part to failure to realize that time actually is not delivered in pre- 
determined lengths, and in part to the changing social scene that 
ensures that data with consistent definitions are seldom available 
over prolonged periods. 


Secular Change 


Secular change refers to changes that occur gradually over 
“long” periods of time, and in epidemiology usually implies 
changes in disease frequency measured over a period of several 
years or decades. 

Morbidity data adequate to measure such changes have to date 
been limited to certain geographic areas and have been restricted 
to notifications of occurrences of infectious diseases and of use of 
mental hospitals. Although valuable in the detection of truly 
eross and rapidly evolving changes, clinical impressions are, of 
course, not dependable in detecting more subtle changes. Conse- 
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quently, death-certificate data have so far been a major source of 
information. 

As measured by death certificate data, no disease has retained a 
constant frequency during the past 50 years. The investigation 
of such secular changes may be directed to two distinct questions: 

(1) Has there been a change in frequency of the disease 
that is really of epidemiologic interest, or is the change due to 
variation in ascertainment, reporting, case fatality, or some other 
complicating circumstance? 

(2) If the trend is of real epidemiologic interest, to what 
mechanisms may it be attributed? 

Changing ascertainment encompasses a number of distinct pos- 
sibilities. Changes due to altered procedure in assigning causes of 
death have already been discussed (see Chapter 6); they are usu- 
ally readily identifiable because of their abruptness and coinci- 
dence with known procedural alteration. Change in case fatality 
rates is an obvious factor that has influence on secular trends in 
death rates. Inconstancy of the characteristics of the population 
must, of course, also be considered. Change in age distribution is 
an obvious and readily identifiable source of secular change; the 
independence of secular change and changes in age distribution 
can be ascertained by investigation of age-standardized or age- 
specific rates. Other changes such as altering proportions of various 
ethnic groups may not be so readily identifiable, or if identified or 
suspected, may not be so readily investigated because of absence of 
population or other data necessary for the computation of group- 
specific rates. 

The component of the over-all question of epidemiologic rele- 
vance that is the most vexing, because it is the least readily an- 
swered, relates to the role of changes in clinical diagnosis, con- 
cepts, and terminology. This problem is most difficult to attack 
because such changes evolve gradually and thus do not have the 
telltale abruptness of procedural changes; further, while it is 
frequently possible to state with confidence that diagnosis of a 
particular condition is “better” now than it was 20 years ago, it 
rarely is possible to estimate how much better and to what ex- 
tent the observed change in disease frequency may be attributed 
to the improvement. Any attempt to estimate how many cases of a 
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particular disease diagnosed at the present time would have been. 
diagnosed with the knowledge and facilities available at some 
time in the past is usually unrewarding. 

Assistance, however, in forming more than a subjective opinion 
on the role of changes in diagnostic capacity as an influence on 
disease trends may be derived from a number of procedures: 

(1) Comparison of trends in the disease under investigation 
with trends in other diseases judged equally susceptible to added 
diagnostic facility. For example, in Figure 27, death rates from 
lung cancer in England and Wales during the period 1911 to 
1955 are compared with death rates from cancers of several other 
internal sites. While lung cancer is by no means the only site that 
shows an apparent increase, the data leave no doubt that some- 
thing has happened for lung cancer that has not happened for 
cancers of other internal sites, and if of a diagnostic nature, be- 
trays a past ignorance that was remarkably localized from an 
anatomic point of view. Such evidence can only be suggestive be- 
cause such a localized oversight is not inconceivable, and also be- 
cause it is rarely possible to find two disorders in which diagnos- 
tic procedures have advanced part passu. 

(2) Comparison of the frequency with which the disorder is 
present at autopsies performed at different times during the ob- 
served secular change. Paradoxically, this procedure will be in- 
creasingly satisfactory the less fatal the disease under investigation; 
that is, the greater the probability that the patients died of some 
other cause. The number of patients with the disease under study 
that is observed will, then, be a measure of the prevalence of that 
disease, and may be related to the number of autopsies performed 
to derive a point prevalence rate (see Chapter 5, page 57). In 
a fatal disease, however, the measure will be more directly re- 
lated to incidence, but it will probably not be possible to convert 
the autopsy data to rates because of inability to define the popula- 
tion to which the deaths relate. The restriction of this method to 
institutions where a high proportion of deceased persons under- 
went autopsy examination in all compared periods reduces errors 
resulting from changing patterns in the selection of deaths for — 
autopsy. However, it does not eliminate the possible effect of 
changes in the types of patients entering the institution before 
death. 
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Ficure 27. Equivalent average annual death rates from selected sites of cancer in persons aged 
50 to 74, England and Wales, 1911-1955. Rates shown are the averages of those for the five 5-year- 
age intervals between 50 and 74 given by Case and Pearson (see legend for Figure 18). Sites shown 
are: (1) lung, (2) stomach, (3) intestine, excluding rectum, (4) pancreas, (5) prostate, (6) uterus, 
(7) ovary. (Data from Case and Pearson, 1956.) 
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(3) Examination of the extent to which the apparent trend 
might be influenced by secular change in the proportion of cases 
misdiagnosed, as based on a variety of assumptions, with consid- 
eration of the reasonableness of the assumptions necessary to 
eliminate the trend. This method, although used infrequently, is 
illustrated by the examination by Gilliam (1955) of the possible 
effect on the trend in lung cancer death rates had various propor- 
tions of patients whose deaths in previous years were certified as 
due to other respiratory diseases in fact had lung cancer. Be- 
cause the number of deaths attributed to other respiratory dis- 
eases was much larger than the number attributed to lung cancer, 
particularly in the earlier years of this century, a quite small pro- 
portion of the other respiratory disease deaths, if they had in 
actuality been lung cancer deaths, would have made a substantial 
addition to the number certified as due to lung cancer, and there- 
fore a consequent substantial reduction in the apparent trend. 
However, the size of the assumed proportion of misdiagnosed 
cases necessary to eliminate the trend was so large that the as- 
sumption did not seem reasonable when considered in conjunc- 
tion with such autopsy material as was available in the literature. 

Lastly, there are a number of general arguments that can be 
applied to a discussion of the epidemiologic meaning of an ap- 
parent secular trend. Thus, trends that are markedly different 
for different age, sex or ethnic groups living in the same commu- 
nity would seem less likely to be artifacts than if they were not so 
localized within special subgroups within the community. Also it 
may be illuminating to ask, for example, in the case of lung 
cancer, whether diagnostic ability has become more sophisticated 
in recent years (say, between 1950 and 1959) at the same rate as 
it did during the earlier years when such diagnostic aids as X-ray 
and bronchoscopy were being developed. In lung cancer the up- 
ward trend has shown no evidence of recent leveling off. 

While there can be no doubt that lung cancer has undergone a 
real and substantial increase during the last three decades, it must 
be noted that secular trends of this magnitude are unusual in the 
noninfectious diseases. Much more frequently, as in the apparent 
increases in death rates from coronary heart disease, leukemia, 
and pancreatic and ovarian cancer, and in the increase in the 
trend of patients admitted to mental hospitals, the issue whether 


Time 171 


an observed secular change reflects a change in disease frequency 
remains in doubt in spite of use of all the procedures and argu- 
ments mentioned. Although this uncertainty regarding the actual 
mechanism of the secular change must be given consideration in 
the formulation of hypotheses of etiology, outright denial of the 
possibility that the change may be of real etiologic significance is 
illogical. 


Interpretations 


The kind and magnitude of social, biological, physical and 
chemical circumstances that change with the passage of time are as 
infinite as time itself. An attempt at their enumeration would 
be purposeless. Discussion of general circumstances related to the 
occurrence of “‘place”’ diseases (see Chapter 10) is equally relevant 
in a consideration of the role of time, but is not repeated here. 

We might note, however, that certain interpretations are 
more easily eliminated from consideration in place associations 
than in time associations. These are the ones discussed under 
changing ascertainment (page 142), since the conditions of vari- 
ous periods of time cannot be subjected to the special surveys that 
can be conducted to test a hypothesis of association with place. On 
the other hand, interpretations involving genetic differences in 
populations are usually more easily eliminated as productive of 
time associations. Associations with time occurring within rela- 
tively short periods such as decades can only be attributed to ge- 
netic changes if there have been marked population shifts. And 
such shifts will usually be identifiable. 

It has been repeatedly noted that changes occurring over a rela- 
tively short time may be more productive of hypotheses than 
those occurring gradually over decades. This is true even when 
the changes do not have the abruptness of a “‘point’’ epidemic. 
An example, given in Chapter 4, is Snow’s observation of a 
change in the incidence of cholera in the customers of the Lam- 
beth Company over a 5 year period. The importance of the obser- 
vation of such a relatively rapid change lies in the restriction of 
the period that must be examined for changing conditions pos- 
sibly productive of the observed change in disease frequency. A 
more recent example is illustrated in Figure 28, which shows a 
striking and progressive increase between 1941 and 1945 in the 
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rates of first admission to mental hospitals in the United States 
for males between the ages of 15 and 34, the greatest increase be- 
ing for the age group 20 to 24. The pattern suggests that the 
stresses of military life may have an etiologic influence in mental 
disorder, at least among persons who under ordinary circum- 
stances would not have elected to enter the service. An alternative 
hypothesis — that induction into the service acted as a case-finding 
mechanism for this age and sex group, and hence, led to increased 
hospitalization rates — no doubt plays some part, but is unlikely 
to be a complete explanation since the trend persists until 1945, 
while the number of persons being inducted and examined de- 
clined markedly after 1943. 


Cohort Analysis of Secular Change 


The principle of cohort analysis as described in the discussion 
of association with age (see Chapter 7) also has application to the 
study of time trends. Thus, in the conventional method of show- 
ing trends in age-specific rates over time, as illustrated in Figure 
29, the abscissa of the graph is the year of death or occurrence of 
the event being measured. Therefore, the age-specific rates shown 
at any point along the abscissa are based on persons belonging to 
different cohorts. We see in Figure 29 that, in the oldest age 
groups, death rates for buccal cancer increased during the early 
years of the period and decreased in the later years, producing a 
peak in the trend about the middle of the period. There is a 
tendency for this peak to appear successively earlier in time as 
successively younger age groups are examined, until, among per- 
sons younger than 50, rates show a downward trend during the 
whole period of observation. 

The question might be asked whether this pattern can be ex- 
plained by the peak rates being characteristic of a particular 
generation of males and whether the later appearance of the peak 
as age advances can be explained by this generation occupying 
successively older age groups in successively later periods of time. 
In other words, are the secular changes noted for buccal cancer 
characteristic of the experience of cohorts, rather than a result of 
differential changes in the environment of persons of different 
age with the passage of time? This question may be examined by 
plotting the data with year of birth, rather than year of occur- 
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Ficure 29. Age-specific mean annual death rates from cancer of the buccal 
cavity and pharynx, England and Wales, 1911-1954. (From MacMahon and 
Terry, 1958.) 


rence, along the abscissa, as in Figure 30. A trend noted between 
any two points on the abscissa now reveals differences between 
successive cohorts rather than between successive time periods. 
We now see that, in contrast to Figure 29, the patterns exhibited 
by the various age groups are similar. At all ages, death rates in- 
creased with each successive cohort and reached a peak in per- 
sons born around 1865; after 1865 each successive cohort born has 
experienced lower death rates from buccal cancer, at all ages, 
than did its predecessor. 


Time 175 


200 


100 


50 


DEATH RATE PER MILLION 


20 


6) 
1835 1845 1855 (865 1875 1885 1895 
YEAR OF BIRTH 


Ficure 30. Data from Figure 29 arranged to show date of birth of the popu- 
lations for which rates are expressed. 


As noted in Chapter 7, the decision whether a cross-sectional or 
a cohort approach is appropriate to a particular situation depends 
on (1) a priori reasoning based on knowledge of the natural 
history of the disease in question, and (2) evidence of a greater 
consistency found by one or the other method. It was also noted 
that many diseases, such as the degenerative diseases, do not show 
any peak in their age curves that allows examination for a greater 
consistency by one or the other method. The method illustrated 
in Figure 29 and Figure 30 offers another possibility in such situa- 
tions, provided that during the period of observation rates for 
the disease examined have undergone a change in direction so 
that a peak (or a trough) in the time series is produced. It can 
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then be determined whether the peak, or the trough, occurred 
consistently (that is, at all ages) in a particular period or in a 
particular cohort. Figure 30 suggests that for buccal cancer the 
changes are characteristic of cohorts, and patterns in a number 
of other neoplastic disorders are also compatible with that hy- 
pothesis (MacMahon and ‘Terry, 1957). Several other neoplastic 
disorders and other degenerative diseases do not allow application 
of this method because trends have been consistently in one di- 
rection during the period for which frequency data have been 
available. 

The general implications of the observed patterns of change 
that are characteristic of cohorts so far as hypotheses of etiology 
are concerned have been discussed in Chapter 7. There are two 
other considerations that are particularly relevant in connection 
with trends in age-specific rates over time. First, changes occur- 
ring in a cohort pattern are unlikely to be explicable in terms of 
changes in effectiveness of therapy, diagnosis or reporting, and 
must be sought in changes in etiologic factors. It is of course pos- 
sible that, for example, effectiveness of therapy might vary with 
age and produce trends that were not consistent in different 
age groups. But it would be a strange coincidence if the change in 
effectiveness of therapy became applicable to successively older 
5 year age groups in successive 5 year periods, as would be neces- 
sary to explain a cohort pattern. 

The second consideration relates to methods of prediction of 
time trends. 


Prediction of Time Trends 


It is sometimes useful for administrative purposes to be able to 
extrapolate time trends beyond the present available data. In this 
context the distinction between current and cohort patterns is 
important. 

For example, age-specific death rates from breast cancer in fe- 
males in England and Wales between 1915 and 1954 are shown 
in Table 26. The crude total rate (all ages) shows an increase in 
death rate over time. However, this must be largely attributed 
to a change in the age composition of the population, since in 
most age groups the trend has been downward since the period 
1935-39 and the total age-standardized rate decreases consistently 
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since the years 1930-34. On this basis, therefore, we might predict 
that age-standardized death rates from breast cancer might con- 
tinue to decline. 

However, examination of trends in the individual age groups 
reveals at least two distinct patterns: 

(1) A peak occurring in the period 1935-1939 and a subse- 
quent decline are most striking in the older age groups and are 
not evident in women younger than 50 years. This peak is evi- 
dently characteristic of current time, rather than of a cohort, 
since it is seen in four of the six age groups over 50, all of which 
belong to different cohorts. (In the other two age groups over 50 
the pattern is similar but the peak occurs in the years 1930-1934.) 

(2) A tendency to increasing rates is present in the younger 
age groups during the latest periods. This upward swing does 
not occur in the same period in each age group, and is in fact 
suggestive of a cohort pattern. Thus the first upward trend oc- 
curred in the period 1950-54 in the age group 45-49, five years 
earlier in the age group 40-44 and five years earlier still in the 
age group 35-39. The pattern suggests that, at all ages attained to 
date, women born around 1900 had the lowest breast cancer death 
rates, and that rates have increased for successive cohorts born 
thereafter. 

On the basis of this latter pattern we might therefore predict 
that as women born after 1900 reach those age groups in which 
breast cancer rates are highest, breast cancer death rates will in 
fact increase. This prediction is, of course, based on the assump- 
tion of a continuation in the cohort pattern. The assumption 
would be made meaningless by the subsequent occurrence of, 
for example, a marked improvement in therapeutic efficacy, 
which would be expected to be characteristic of a time period 
rather than of a cohort. 

With respect to the forecasting of trends in general death 
rates, there has been considerable controversy among demogra- 
phers as to whether cohort or current methods should be used 
(Pollard, 1949). The problem seems to us an empiric rather than 
theoretical one, and can only be decided by examination of past 
data to determine whether cohort or current patterns are occur- 
ring. Actually, in the context of so complicated a mechanism as 
the determinants of total death rates it is quite certain that mech- 
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anisms following both types of patterns are operative. In the pre- 
diction of trends in individual diseases more simple patterns may 
be evident and form the basis for predictions which will come 
closer to the mark. 


CYCLIC FLUCTUATIONS 


If this text were concerned with the content of epidemiologic 
knowledge rather than with epidemiologic methods, it would be 
necessary to devote considerable space to periodic variations in 
disease frequency. For example, seasonal fluctuations are among 
the most striking features in the epidemiology of the infectious 
diseases and have received a great deal of attention. 

However, the methods involved are comparatively simple. 
Cases are plotted by time of onset according to a conventional 
time unit. In detection of seasonal fluctuations, months or quarter 
years are generally used. In order to detect periodicity of any 
type it is necessary to use time units which are some fixed fraction 
of the interval between two successive fluctuations. Use of a refer- 
ence population and the calculation of a series of rates are un- 
necessary refinements, unless periodic changes in the size of the 
population are suspected. 

Unlike the acute diseases, chronic diseases have received little 
attention with respect to possible periodicity of occurrence. ‘This 
is probably the result both of difficulty in defining date of onset 
in chronic diseases and of belief that etiologic factors in chronic 
diseases are operative through such prolonged induction periods 
that correlation with some other periodic variation (such as sea- 
son) would be unlikely. 


Interpretations of Seasonal Variations 


Methods of utilizing knowledge of seasonal periodicity to for- 
mulate hypotheses center around the process of correlating sea- 
sonal variation in disease frequency with seasonal variation in 
other circumstances of possible significance. For example, many 
studies have related disease occurrence to weather conditions 
(and hence to season). Figure 31 shows monthly deaths and 
monthly normal temperatures for three cities in the United 
States. As noted by many writers, the inverse correlation is strik- 
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Ficure 31. Seasonal variation in mortality and temperature in three cities. 
The temperature curves are based on monthly normal temperatures for 1921- 
1950; mortality curves are based on monthly totals of deaths, 1949-1958, com- 
puted on the basis of daily averages. (Reproduced, with permission, from 
Kutschenreuter, 1959.) 


ing. However, it should be noted that although seasonal 
fluctuations in mortality are as great in Los Angeles as in New 
York City, the seasonal temperature fluctuations are smaller in 
Los Angeles than in New York City. Similarly, the temperature 
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fluctuations are greater but the mortality fluctuations less in Cin- 
cinnati than in Los Angeles. These inconsistencies suggest that, 
in spite of the inverse correlation, normal temperature variations 
account for only a small part, if any, of the seasonal mortality pat- 
tern. No greater success has been achieved in correlating other 
indices of seasonal climate variation with mortality, although it 
is likely that short periods of highly unusual climatic conditions, 
such as heat waves, have an appreciable effect on mortality. 

Seasonal variation in the flora and fauna of the human environ- 
ment has a profound effect on many human diseases and has been 
the area in which greatest use has been made of periodic variation 
in the development of hypotheses of causation. The resulting 
investigations have ranged from the identification of pollens con- 
cerned in hay fever to the search for insect vectors whose seasonal 
reproductive and feeding patterns are such as to explain seasonal 
patterns in disease frequency. For example, diseases contracted 
from ticks, such as Rocky Mountain spotted fever, show marked 
seasonal localization in the spring and early summer, correspond- 
ing to the period of feeding activity of the adult ticks. ‘The mos- 
quito-borne encephalitides, on the other hand, are localized in the 
late summer and early fall. 

Seasonal fluctuations in human activities may also account for 
variation in exposure to sources of infection. Thus, the marked 
concentration of human leptospirosis during the summer months 
can be accounted for by increased exposure to infected waters 
through bathing and fishing. The crowding of persons together 
during the cold weather leads to conditions conducive to the 
spread of epidemic typhus. 

Seasonal variation in the acute infectious diseases of childhood 
constitutes one of the major unsolved problems of infectious dis- 
ease epidemiology. In spite of the consistency and striking mag- 
nitude of this variation, it remains unexplained; yet if, for 
example, the factors responsible for the seasonal variation in polio- 
myelitis were known and lent themselves to practical alteration 
for preventive purposes, the disease would cease to be a problem. 
Simpson (1952) showed that there was no seasonal variation in 
the secondary attack rates of measles, chickenpox and mumps 
once an infection had been introduced into a household. Seasonal 
variation in virus communicability does not therefore seem a 
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likely explanation of the seasonal variation in these diseases. 

Incidentally, seasonal circumstances that are entirely man made 
may play a part in seasonal disease patterns. ‘This is illustrated by 
the study of Friedman et al. (1958) which showed rises in serum 
cholesterol levels of accountants corresponding to critical dates in 
the tax calendar. If this is a real effect, it is unlikely that it is 
limited to accountants. 

For diseases in which environmental influences acting during 
intrauterine or early postnatal life may be of etiologic signifi- 
cance, season of birth is an informative descriptive variable which 
may point to the existence of such influences and give hints as to 
their nature. Probably the most clearly substantiated relationship 
of this type is that between intelligence and season of birth. Chil- 
dren born in the winter months have a significantly lower mean 
score on intelligence tests than those born in the summer months 
(Pintner and Forlano, 1943). In addition, risk of admission to a 
state school for the mentally retarded was shown by Knobloch 
and Pasamanick (1958) to be highest for children born in Febru- 
ary and lowest for those born in August. A number of hypotheses 
have been developed as a result of this observation, one of which 
relates the mental retardation to unidentified nutritional dis- 
turbances in early pregnancy, these disturbances being caused 
by changes in maternal diet during the hot summer months. In 
an interesting investigation of this hypothesis, Knobloch and 
Pasamanick compared admission rates for mental retardation for 
children whose critical stage of intrauterine development (in this 
case, supposed to be the third month) occurred during summers 
in which the temperature was high with those born following 
summers in which the temperature was low. Admission rates 
were significantly higher for the former group than for the lat- 
ter. This is an interesting combination of season and secular vari- 
ation which deserves further elucidation. 


CLUSTERING IN TIME 


The descriptive studies discussed so far have all been based on 
the comparison of rates that involve knowledge of reference popu- 
lations. As noted in the discussion of spot maps, “point’’ epidem- 
ics and periodicity, comparisons are sometimes made on the basis 
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of numbers of cases, but knowledge of the population is implicit. 
Thus, in the case of an epidemic such as that of the London fog, 
it is assumed that the population base remained the same 
throughout; thus, the principle of comparison of two or more 
populations with respect to disease frequency still holds. 

However, it occasionally happens that hypotheses of etiology 
are derived from the clustering of certain events along the time 
axis in the histories of affected individuals, even though no in- 
ferences regarding the population at risk are made and no com- 
parisons with other groups are involved. This time clustering is 
best seen in diseases in which the interval between precipitating 
event and onset of illness can be measured with precision. 

For example, Aycock and Luther (1929) obtained histories of 
tonsillectomy from a series of patients with poliomyelitis. ‘They 
noted that among 36 cases of poliomyelitis that occurred within 
12 months of tonsillectomy there were 16 in which poliomyelitis 
followed within i month of the operation. All these 16 cases oc- 
curred within 7 to 18 days of tonsillectomy, a period the authors 
considered compatible with the incubation period of poliomyeli- 
tis. Even though no data were available to determine how many 
children selected at random might have given a history of ton- 
sillectomy within 7 to 18 days, the marked clustering of ton- 
sillectomies strongly suggests a causal relationship between tonsil- 
lectomy and poliomyelitis in at least some of the cases. 

Similarly, data of Tetlow (1955) on the distribution by month 
of 67 patients in whom mental disorder was associated with preg- 
nancy or the puerperium show a marked clustering of cases with 
onset of symptoms in the 4 weeks after delivery (Figure 32). 
Within the group with onset in the 4 weeks after delivery there 
was additional clustering, nearly half of them becoming ill in the 
week immediately following delivery. Such a clustering in time 
indicates the probable presence of psychotogenic factors opera- 
tive around the time of delivery of a child at least among some of 
the women observed. 

A more statistically sophisticated variation on this theme is the 
measurement of the interval between the suspected event and on- 
set of the disease in a series of cases, and the determination of the 
mean and variance of this interval. The variance of the interval 
should be smaller if the measurement is taken from the date of 
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FicureE 32. Distribution by time of onset of symptoms of 67 patients with 
mental disorder associated with pregnancy and the puerperium. (From Tet- 
low, 1955.) 


occurrence of a causally related event than a variance derived 
from a mean of points in time that have a less direct causal con- 
nection to the disease. This method was used in an investigation 
of the possible relationship of hepatitis among service personnel 
to yellow fever vaccination (Sawyer et al., 1944). It was impos- 
sible to compare affected and unaffected individuals with respect 
to frequency of yellow fever vaccination, since practically all the 
military personnel under observation had been vaccinated. How- 
ever, investigation of the time relationship showed that the inter- 
val between vaccination and onset of the disease was less variable 
than the date of onset. The mean date of onset for the 5917 Cali- 
fornia cases was May 31.9, and its standard deviation 5.7 weeks, 
while the mean interval between vaccination and onset of jaun- 
dice was 14.6 weeks and its standard deviation 3.5 weeks (Figure 
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Ficure 33. Distribution according to week of onset and according to interval 
between yellow fever vaccination and onset of the disease for 5917 cases of 
jaundice, U.S. military units in California, 1942. (From Sawyer et al., 1944.) 


With respect to clustering of events along the time axis in the 
histories of affected individuals, it should be noted that: 

(1) For clustering to be observed with greatest clarity the 
interval between effect and presumed cause should be measured 
with as much precision as possible, in order to keep as small as 
possible that part of the variance derived from errors in measure- 


ment. 
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(2) Clustering will be less apparent when the interval be- 
tween effect and cause is very variable, and additionally so if 
there is a consistent change in this interval during the course of 
an epidemic. For example, if the interval between exposure and 
onset decreases as an epidemic progresses (as may be the case in 
an outbreak of staphylococcal food poisoning), it is possible 
that the actual dates or times of onset of cases may be more 
closely grouped about their mean than the intervals between ex- 
posure and onset are about theirs. Consequently, the fact that the 
variance of the interval dated from a certain event is not smaller 
than the variance of, for example, the date of onset does not nec- 
essarily indicate a lack of causal connection between the event and 
the disease. 

(8) While clustering in time does indicate that a certain 
preceding event and the onset of a certain disease are associated 
in time, the same reservations about the causal nature of such an 
association must be made as are made about associations based on 
relative frequency of concordance of two events. 


12 
Birth Order and Parental Age 


In the investigation of certain categories of disease or defect 
there is especial interest in the identification of environmental 
influences operative during intrauterine life. This is particularly 
true of disorders manifest at birth or in infancy, for example, 
the congenital malformations, although it is equally conceivable 
that prenatal environmental characteristics may be causally asso- 
ciated with mental and physical attributes and defects that are 
manifested much later in life. 

Through astute clinical observation certain noxious agents 
operative during prenatal life have been brought to attention. 
For example, Gregg (1942) noted the occurrence of maternal 
rubella during a number of pregnancies productive of offspring 
who were congenitally blind from cataracts. Subsequent special 
studies confirmed Gregg’s suspicions as to the causal nature of 
this association. In similar fashion some of the undesirable effects 
of prenatal irradiation, maternal syphilis and toxoplasmosis have 
been identified. The epidemiologic methods used in the testing 
of these clinical suspicions are described in subsequent chapters. 

The specific agents identified to date account for only a minute 
fraction of the cases of disease and defect in which it seems logical 
to believe that prenatal environmental influences may play a 
part. The vast majority of cases of congenital malformation and 
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mental retardation, for example, remain unexplained. In addi- 
tion, there are large areas, for example, in the mental disorders, 
in which it seems reasonable to look for evidence of prenatal 
influences even though the mechanisms by which such influences 
might be operative are not obvious. Conceivably other specific 
agents will be identified through direct clinical observation. Fur- 
ther, the animal-vegetable-or-mineral approach of descriptive epi- 
demiology is appropriate to such problems; this approach will 
help to determine whether it is likely that prenatal environ- 
mental influences of some nature are causally related to particu- 
lar manifestational disease categories and to localize investigative 
efforts within the potentially most profitable areas. 

In addition to the general descriptive variables that have been 
discussed in previous chapters, there are certain other variables 
peculiarly appropriate to this area of investigation. ‘The two that 
have the widest application are order of birth in the sibship and 
age of parents at the time of birth. These two variables are the 
subject of this chapter. It should perhaps be made clear that we 
are concerned here with the effects of parental age and birth 
order (or “birth rank’”’) on characteristics of the offspring, and not 
with their effects on disorders of reproduction exhibited by the 
parents. 


METHODS 


Birth certificates, obstetric records of hospitals, and interviews 
are the common sources of information on the distribution of a 
series of patients according to birth order and maternal age. 

There are two general methods of examination: 

(1) Comparison of the birth order or parental age distribu- 
tion of the series of affected individuals with the population of 
births from which they were drawn, or with a representative 
sample of that population. 

(2) Comparison of the birth order or parental age distribu- 
tion of the series of patients with the expected distribution calcu- 
lated from the size and other characteristics of the sibships into 
which the patients were born. 

Comparison with the related population, or with a sample of it, 
is the method of choice in studies in which the interval between 
the birth of the series of patients and the date of the study is 
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short. This requirement is most likely to be met in studies of 
characteristics which are diagnosed at birth or shortly thereafter. 
The method is difficult to apply in studies undertaken many 
years after the birth of the affected individuals, such as in studies 
of diseases first appearing in adult life, since in these circum- 
stances it is usually difficult to identify and obtain information 
on the related population of births. 

Fortunately, the second method, described by Greenwood and 
Yule (1914), is particularly applicable to studies in which there 
has been a long interval between the birth of the affected individ- 
uals and the time of the study. The basic assumption in the 
Greenwood-Yule method is that an affected sibling is equally 
likely to be born at any of the birth ranks or parental ages repre- 
sented in the sibship to which he belongs. This assumption is 
valid only if the families under study are complete — that is, if 
the parents responsible for the sibship have completed their re- 
production. The Greenwood-Yule method is therefore particu- 
larly suitable for the examination of conditions which do not ap- 
pear until adult life; in such cases it may be safely assumed that 
by the time the characteristic appears the sibship into which an 
affected person was born will be complete. 


Definitions 


There will be little difficulty in defining parental age (the age 
of either or both of the parents at the time of birth of the index 
case). There are, however, several definitions of birth order, de- 
pending largely on the treatment of fetal losses and multiple 
pregnancies. For example, in calculating the number of previous 
pregnancies, fetal losses may be ignored, birth order of the pa- 
tient then being defined as the number of previous livebirths 
plus one. Alternatively, both liveborn children and fetal deaths 
may be counted, or, sometimes, only fetal deaths occurring be- 
yond a certain stage of pregnancy may be included with livebirths. 
Multiple pregnancies may be regarded as contributing either one 
previous pregnancy or two or more previous children. 

The choice of a definition of birth order may be limited by the 
information available. For example, most national vital statistics 
record merely the number of liveborn children prior to the birth , 
being reported. The choice may also be influenced by the accu- 
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racy of the information available. Thus, while the use of the total 
number of .previous pregnancies has some appeal, it has also the 
disadvantage that differences between groups in the success of 
efforts to obtain full histories of early fetal loss will be reflected in 
an apparent difference in birth rank. The choice of a definition 
may also be influenced by the nature of suspected etiologic fac- 
tors. For example, if it seems likely that the process of labor or 
factors associated with the number of children already in the 
household may have an etiologic influence, then it would seem 
reasonable to seek for differences between groups with respect to 
the number of previous liveborn children or of liveborn and sur- 
viving children. If, however, the mere fact of pregnancy is likely 
to have an etiologic influence on the condition, for example, if 
evidence of hormonal influence is being sought, then the fuller 
definition of birth order would seem more appropriate. When 
data are available, it would seem advisable to conduct several 
comparisons using different definitions. 

More important than the choice of a definition of birth order 
is the requirement that the same definition be applied to all 
compared groups. 


Comparison with the Related Population of Births 


Clearly, comparison of the distribution of a series of affected 
individuals with the related population of births is possible only 
if that population can be identified — for example, if the affected 
individuals comprise all the individuals with the particular char- 
acteristic identified in a specific geographic or administrative area. 
In such circumstances the collection of information on the re- 
lated population may be relatively easy. For instance, several 
countries, including the United States and Great Britain, publish 
annual statistics on the distribution of all births by birth rank and 
maternal age. For the United States, data on age of fathers are 
also available. Such data allow rates to be computed for any dis- 
order for which information on birth rank or parental age dis- 
tribution is also available on a nation-wide or state-wide basis. 
They have been used extensively in studies of fetal death and in- 
fant mortality rates, multiple pregnancy, and sex ratio at birth. 
An example, showing the relationship of fetal death rates to birth 
order in data for the United States, is given in Figure 34. 
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Ficure 34. Fetal death rates in whites and non-whites related to order of 
birth, United States, 1957. Birth order is derived from the number of children 
born to the particular mother, including fetal deaths. Data for this figure and 
for Figure 36 are derived from Vital Statistics of the United States (National 
Office of Vital Statistics, 1957). They do not include data for Massachusetts, 
which does not require the reporting of birth order or birth weight. 
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Data on distribution by birth order and maternal age of a large 
population of births are also occasionally available from obstetric 
records in hospitals where tabulation of this information is un- 
dertaken routinely. Material from obstetric records in hospitals 
has the advantage of detail, of diagnostic accuracy, and of ena- 
bling examination of many more disorders and characteristics 
than are recorded in vital statistics. An example of the use of 
such material, showing the relationship of the prevalence of 
mongolism to maternal age, is shown in Figure 35. In areas in 
which hospitalization for delivery is not routine and where pri- 
orities for hospital delivery may be established in part on the 
basis of age and parity, hospital material may be unsuitable. | 


Prevalence per 1,000 births 


Age of mother at delivery 


Ficure 35. Prevalence of mongolism as diagnosed at birth related to maternal 
age, ten London maternity hospitals, circa 1949, (From Carter and MacCarthy, 
1951.) 


Use of comparison samples. Comparison with the related popu- 
lation of births may not be possible either because the parental 
age and birth order distribution of the population 1s unknown, or 
because the related population of births cannot be identified. 
Various forms of comparison samples may then be used. The 
methods of selection of such samples are similar to those used in 
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the selection of comparison groups in a wide variety of case his- 
tory studies and are described at length in Chapter 14. Only a 
few examples are given here. 

When the related population can be identified but parental 
age and birth order data are not available, the usual procedure 
is to select a random or systematic sample of the population of 
births. For example, a number of studies have utilized informa- 
tion derived from a sample of all births occurring in the admin- 
istrative area of the City of Birmingham, England. This sample, a 
1 in 200 sample of all births in the city during a certain period, 
was originally assembled from registers of births for comparison 
with a series of infants with malformations of the central nervous 
system (Record and McKeown, 1949). Subsequently, this sample 
has been used to investigate the age and parity associations of a 
wide variety of congenital malformations. In each study the se- 
ries of affected individuals consisted of all children identified as 
having the specified malformation who were born in the city 
during the same years covered by the sample of births. In this 
case, the required information on birth rank and maternal age 
was obtained by field inquiry from the mothers in the sample. 
Frequently however, such information will be available from 
the sampled records without the necessity of field inquiry. This 
will almost always be true, for example, if the sample is drawn 
from hospital records. 

If a field inquiry or some further procedure is necessary to 
obtain information after the sample has been selected, it is im- 
portant that equivalent effort be devoted to the completion of 
that information in both the patient series and in the comparison 
sample. Because of the greater mobility of young adults with 
small families, that proportion of the sample for whom the sub- 
sequent information is not obtained is likely to be over-repre- 
sented in the younger parental ages and in the low birth orders. 
Provided this bias affects the two compared groups equally it is 
not likely to cause serious difficulty. It would, however, be un- 
wise to compare a patient series in which birth order was known 
for all with a sample of the related population of births in which 
birth order was unknown for any appreciable proportion. 

As described in Chapter 14, some idea of the risks associated 
with particular birth ranks or maternal ages may be obtained by 
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assuming that the distribution of the comparison sample repre- 
sents the distribution of the total population of births. An illus- 
tration, using the previously described Birmingham sample, is 
shown in Table 27. The estimate of the birth rank distribution 
of all livebirths is derived by applying the percentage distribu- 
tion of the comparison sample by birth rank to the known total 
number of livebirths (164,271). These estimates are then used 
as the denominators of incidence rates. 


TABLE 27 


Birth rank distribution of all patients with infantile pyloric stenosts 
born in Birmingham, England, 1943-1952 and of a 0.5 per cent 
sample of the related population of births* 


Estimated birth 
rank distri- Estimated 
bution of all incidence 
livebirths of pyloric 
rank Number Percent Number Per cent (164,271 X stenosis 
previous _ per 1000 
column) __ livebirths 


Birth Pyloric stenosis series Comparison sample 


] 244 BL. 308 36.1 59,318 4.1 
a $30 204 271 31.8 52,189 at 
3 a 11.9 123 14.4 23,688 24 
4 19 4.0 74 8.7 14,242 1.3 
5+ 19 4.0 77 9.0 14,834 rE 
Total 478 100.0 853 100.0 164,271 ao 


* Data from McKeown et al. (1951). 


If the series of patients to be examined cannot be referred to 
a specific population of births, for example, in a series of patients 
with a particular disease assembled from the records of a pedia- 
tric hospital, greater difficulty will be encountered in the selection 
of a suitable comparison group. Some form of paired sample, as 
described in Chapter 14, will probably be most suitable, although 
the birth rank distribution of a series of births varies so much in 
relation to economic status of the mothers, place and time of oc- 
currence of the birth, racial and religious factors, and other vari- 
ables that it is very difficult to select a comparison group for a 
particular situation which does not differ from the patient group 
in a way that might affect its birth rank composition. 

For example, Still (1927) published a well-known paper on 
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“place-in-family”’ as a factor in disease, noting that the percentage 
of first-born children among 400 patients with pyloric stenosis 
admitted to a pediatric hospital was 48.5, compared with a 
“standard” of 18.5 per cent among patients with lobar and bron- 
chopneumonia, the latter being acute diseases that were consid- 
ered unlikely to be related to place-in-family. While Still’s con- 
clusion — first-born children are more likely to develop pyloric 
stenosis than are later-born children — was substantially correct 
(the magnitude of the association in this disease is so great that it 
is difficult to conceal), the very low percentage of first-born in 
the “‘standard”’ suggests that this was not a suitable comparison 
group. This is confirmed by the fact that other groups of patients 
also showed a higher percentage of first-born than the standard 
but that, for several of these, later work did not indicate the 
presence of a birth rank association. The nature of the diseases 
that showed a birth rank distribution similar to the standard 
(tuberculosis, “‘digestive diseases’) suggests that the standard 
may have been drawn from a lower economic group than the gen- 
eral run of the hospital’s patients. 


The Greenwood-Yule Method 


Birth order. The basic assumption in the Greenwood-Yule 
method is that if the risk of appearance of a characteristic is un- 
related to order of birth, then among completed sibships of size 
s containing n affected individuals there will be n/s affected in- 
dividuals in each birth rank. This supposition is used in the calcu- 
lation of an expected distribution by birth rank, for comparison 
with the observed distribution. The method may be illustrated 
by using the data shown in Table 28. The table is taken from a 
study by Norton (1952) of 2500 patients with neurosis. 

The information required for the calculation of the expected 
distribution by birth order is the total distribution by sibship 
size, shown in the right hand column of Table 28. ‘The calculation, 
in which the data for each sibship size are considered separately, 
is shown in Table 29, The 226 affected individuals who had no 
siblings must obviously all be first births. In a random distribu- 
tion, the 358 affected individuals who came from sibships of size 
two might be expected to be half (179) first births, and half 
(179) second births, and the 374 affected individuals in sibships 
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of size three might be expected to represent the first, second and 
third birth orders in equal proportions. Thus, the expected dis- 
tribution by birth order is calculated for each sibship size, and 
the total expected distribution is derived by summation (bottom 
row of ‘Table 29). 

When the expected distribution by birth order has been ob- 
tained, it may be compared with the observed distribution, as in 
Table 30. Note that, while birth orders may be grouped for the 
purpose of this comparison, as in this example into “6+ ,” they 
cannot be so grouped during the derivation of the expected dis- 
tribution, since the latter calculation requires use of each individ- 
ual family size. ‘The comparison of expected and observed distribu- 
tions in this example shows that there was a higher than expected 
frequency of patients with neurosis in birth orders 5 and 6 and 
over, and a lower than expected frequency in birth orders 1, 2, 
and 3. 

TABLE 30 


Comparison of total observed and expected distributions by birth order 
of 2500 patients with neurosis* 


Birth Observed Expected Observed 
order distribution distribution Expected 
] 751 194.7 0.95 
2 5595 568.7 0.97 
3 369 309.7 0.95 
4 266.5 265.0 1.01 
5 207 178.0 1.16 
6+ 353 304.5 1.16 

Total 2500 2500.6 a 


* Data from Tables 28 and 29. 
ye = 16.37," = 5, p<001 


There are a number of minor points to be considered in these 
calculations. In the first place, the treatment of twins. Probably 
the most satisfactory procedure for dealing with twins is to count 
them as contributing two units to the calculation of family size 
and, if one of them is affected, to make their contribution to the 
observed distribution one-half at each of two consecutive birth 
ranks. ‘Thus, a sibship of size three consisting of one older sib- 


ling and a pair of twins, one of whom was affected, would be 
scored as follows: 
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Birth rank 
1 2 ss) Total 
Observed a rs ‘Wy 1 
Expected ly \ ly 1 


The question of inclusion or exclusion of deceased siblings in 
the calculation of sibship size is of some importance. Neonatal 
and infant deaths occur more frequently in certain birth orders 
— particularly the first and the late orders. If the characteristic 
under consideration is one which is not manifest in early life, 
there will be a proportionately lower risk of being affected in 
these birth orders. Consequently, in the calculation of sibship 
size, only those individuals who lived long enough to have been 
at risk of exhibiting the characteristic should be included. 

Familial disorders. A number of other questions are raised by 
the application of the Greenwood-Yule method to diseases in 
which more than one member of a sibship is likely to be affected 
either because the disease is very common or because it shows 
a familial predisposition. It is important to note that, in the for- 
mula on page 195, n is not the number of sibships in which affected 
individuals occurred, but the number of affected individuals in 
those sibships. These two items are the same in the case of con- 
ditions that are uncommon and nonfamilial. 

When dealing with conditions showing high familial incidence, 
the error resulting from application of the Greenwood-Yule 
method to incomplete families will be greater than in nonfamilial 
conditions, since the probability of the early birth orders escap- 
ing and the family being ascertained through an affected individ- 
ual in the later birth ranks will be decreased. 

In a series of sibships, those persons occupying the early birth 
orders will be on average older than those in the later ranks. 
When dealing with a familial condition whose incidence increases 
with age, the possibility that incidence is higher in the early 
than in the later birth orders simply on this account should be 
considered. This consideration is not relevant if the probability 
of more than one affected individual occurring in a family is 
small. 

Incomplete sibships. It has already been stated that application 
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of the Greenwood-Yule method requires that the sibships exam- 
ined be complete. Use of the method on a series of incomplete 
sibships results, as a rule, in an over-representation of affected 
individuals in the higher birth orders and maternal ages, since 
data for the later-born members of large families will only be 
obtained for such families as have an affected individual in the 
later birth orders. Consequently, there is likely to be an incorrect 
suggestion of increased risk in the later birth ranks and maternal 
ages. The shorter the interval between birth and the time of the 
study, the greater will be the bias from this source. 

The extent and types of this error have been considered by 
McKeown and Record (1956), who list a number of studies in 
which erroneous conclusions have resulted. These authors also 
suggest a modification of the method for use when sibships are 
incomplete and data are not available on the related population 
of births. ‘The method is suitable for use when cases have been as- 
sembled during a limited period, and all siblings born within 
that same period (either before or after the patients) are known. 
In this method, the expected distribution by birth order is de- 
rived in the same way as in the Greenwood-Yule method, except 
that siblings born outside the period during which the patients 
were assembled are excluded. ‘Thus, if the patient series was as- 
sembled from affected individuals born during the period 1950 to 
1954, a family comprising siblings born in 1949, 1950, 1952 (the 
patient) and 1955 would be scored as follows: 


Birth rank 
1 p's 3 4 Total 
Observed 0 oO iy 0 1 
Expected 0 Vy Vy 0 1 


This method is suitable only in the study of conditions diag- 
nosed at, or shortly after, birth. McKeown and Record explored 
its application in theoretical and practical situations. The method 
gives good results on both complete and incomplete sibships in 
the theoretical models, and in practice does not give the spurious 
later birth rank effect given by the Greenwood-Yule method on 
incomplete sibships. However, in a number of actual situations, 
the method failed to demonstrate a birth order or maternal age 
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effect known to be present. This failure is attributed to the 
method being readily disturbed by secular changes in disease in- 
cidence or ascertainment occurring over the period of collection 
of the data. The method does not seem to give sufficiently con- 
sistent results to supplant the collection of information on the re- 
lated population when the latter is possible. 

Parental age. Association of a disease or characteristic with age 
of either or both parents may be assessed by a similar compari- 
son of observed and expected distributions. The expected distri- 
bution is derived by apportioning probabilities of the occurrence 
of the affected individual in accordance with the total number of 
children in the sibship born at each of the parental ages. ‘Thus a 
family of four, born when the mother was 22, 26 (affected), 28 and 
33 (affected) would be scored as follows: 


ce 


Maternal age 


20-24 25-29 30-34 Total 


mmm 


Observed 0 1 ] 2 
Expected \% 1 Vy 4 


Consideration of incomplete sibships, twins, and deceased sib- 
lings is similar to that outlined in the discussion of birth order. 


Separation of Variables 


Paternal age, maternal age, and birth order are closely asso- 
ciated. A relationship with any one of them is likely to result in 
secondary associations with the others. For example, in the study 
of pyloric stenosis to which reference has already been made 
(page 194), associations with both birth order and age of mother 
were found. The data are shown in Table 31. The column totals 
show that incidence decreased with birth order from 4.3 per 
thousand in first births to 1.4 per thousand in fourth and later 
births, and the final column shows that incidence decreased with 
maternal age from 4.0 in mothers aged 17-21 to 2.6 in mothers 
aged 37 and over. Either one of these associations could be sec- 
ondary to the other, since of course first births constitute a larger 
percentage of all births in younger mothers than in the older age 
eroups. The fact that the decrease is more marked with birth 
order than with maternal age suggests that birth order may be 
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the primary variable involved. However, this is no more than a 
suggestion, since the degree of apparent association will depend 
in part on the grouping adopted, and the grouping used may be 
more satisfactory for the one variable than for the other. 


TABLE 31 


Incidence of infantile pyloric stenosis per 1000 livebirths, related to 
birth order and maternal age, Birmingham, England, 1942-1949 * 


ia enc Birth order 
age st 2 Ss ee 

(years) ] 2 3 44 Total 
17-21 4.6 1.8 a — 4.0 
22-26 4.0 2.6 2.9 1.7 3.3 
27-31 24 2.2 22 22 2.9 
32-36 7 4.4 a2 20 1.0 20 
37 + eke 6.1 3.1 V2 2.6 
Total 4:3 2.8 2.5 i 3.03 


* From McKeown et al., 1951. 


As noted elsewhere, examination of the independent effects of 
two variables requires examination of association with each of 
them while the other is held constant. The body of Table 31 
shows the trends with maternal age that are evident in each birth 
order (reading down the columns) and the association with birth 
order evident in each maternal age group (reading along the rows). 
When birth order is held constant the inverse association with 
maternal age evident in the total is no longer seen. On the other 
hand, the downward trend with birth order is evident in each 
maternal age group, as well as in the total. This suggests that the 
association with birth order is independent of maternal age, but 
that the downward trend with maternal age results from the 
association with birth order. With regard to the maternal age 
association, there is, in fact, a suggestion in the data for second 
births of a maternal age effect directly opposite*to that seen in the 
total for all birth orders. 

In addition to the separation of parental age from birth order, 
there is also the important and even more difficult question of 
separating the effects of maternal and paternal age. This has 
been attempted by Yerushalmy (1938) in a study of neonatal 
mortality, and recently by Novitski and Sandler (1956) in a 
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study of sex ratio. In both cases, an influence of paternal age in- 
dependent of maternal age was noted, although in the case of 
neonatal mortality the association with maternal age was more 
marked than that with father’s age. The Greenwood-Yule method 
can also be modified to separate the effects of parental age and 
birth order, as described by Penrose (1934), but the computation 
is complex and laborious; apart from its initial utilization by Pen- 
rose on mongolism, the method has rarely been used. 


Effect of Variation in Fertility 


Certain forms of variation in parental fertility may produce 
characteristic birth order or maternal age associations. Particu- 
larly, the following possibilities should be considered: 

(1) Does the disease under study tend to occur in smaller 
or larger than average families? In such a situation the Green- 
wood-Yule method will not suggest an association with birth or- 
der unless one is present independently of the association with 
family size. However, by comparison with the related population 
of births, a characteristic occurring in small families will appear 
to be more common in the lower birth orders, and one occurring 
predominantly in large families will show a positive association 
with increasing birth order. The patient series may be compared 
with the related population within given family sizes, to deter- 
mine whether any association with birth order exists when family 
size 1s held constant. 

(2) Following the birth of a child affected with the condi- 
tion under study is there voluntary or involuntary limitation of 
further reproduction of the parents? Voluntary limitation might, 
for example, follow the birth of children with certain congenital 
malformations or follow multiple births, and involuntary limita- 
tion might be associated with conditions such as ectopic preg- 
nancy. The effect of such a circumstance on birth order as studied 
by the Greenwood-Yule method is the same as that of the exami- 
nation of incomplete families — increased incidence in the later 
birth orders will necessarily result. Furthermore, since the age 
of the parents of such families may be such as to ensure that un- 
der ordinary circumstances their reproduction would be complete, 
there may be no way of determining whether a birth order asso- 
ciation of this type revealed by the Greenwood-Yule method 
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results from family limitation or not. The reverse situation 1n 
which the birth of an affected individual leads to increased sub- 
sequent fertility also produces a spurious birth order effect with 
the Greenwood-Yule method, this time suggesting increased inci- 
dence in the early birth orders. This situation may arise in studies 
of conditions resulting in stillbirth or early death. 

Comparison of affected individuals with the related population 
of births, or with a sample of the same, eliminates birth order 
associations arising from changes in fertility subsequent to the 
birth of the affected individuals, provided the condition under 
study is not familial. If the condition is strongly familial, family 
limitation after the birth of an affected individual may result 
in an apparently increased incidence in the early birth ranks. 
However, the data collected in a related population group fre- 
quently enable comparison of the affected families with the fam- 
ilies in the related population with respect to number of chil- 
dren born between the time of birth of the index child and the 
time of the study. Such a comparison will detect the possibility of 
this source of error. The data necessary for this examination are 
not likely to be available in studies of national and other large 
scale statistics. 


INTERPRETATION 


If the frequency of a condition is related to birth order or 
parental age, it may be assumed that some environmental influ- 
ence is of importance in the etiology of the condition, since the 
genetic processes of segregation and random assortment are theo- 
retically independent of age and parity, and genetically deter- 
mined characteristics should therefore be distributed at random 
to all children of given parents regardless of their birth order or 
of the parents’ ages. Such evidence of environmental influence 
may alone justify studies of these variables, even if no more 
specific interpretation is possible, since it may contribute impor- 
tantly to concepts of the relative roles of genetic and environ- 
mental factors in the etiology of the disease in question, Age or 
birth order associations may be found in conditions in which ge- 
netic factors are known to be important, for example, harelip in 
the mouse (Reed, 1936) and polydactyly in the guinea pig (Holt, 
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1948). In such cases, however, the existence of environmental in- 
fluences modifying expression of the genetic potentialities must 
be presumed. The investigation of the environmental “modifiers” 
may then have more immediate practical potential than further 
attempts at elucidation of the genetic make-up. 

It should be noted parenthetically that association with birth 
order or maternal age does not indicate that the resultant envi- 
ronmental influences are necessarily prenatal in timing. Many 
social and other characteristics of the postnatal and later child- 
hood environments are undoubtedly related to these variables. 
Naturally any evidence of environmental influence in the etiology 
of congenital disorders adduced through birth order and maternal 
age associations must be prenatal by definition. 


Types of Mechanisms 


Three general types of mechanisms that are explanatory of 
birth order and maternal age associations may be recognized: 
(1) nongenic intracellular inheritance, (2) maternal structural and 
functional changes associated with changes in age and parity, and 
(3) social determinants of fertility patterns. 

Nongenic inheritance. The mechanisms involving nongenic in- 
tracellular inheritance are probably the least well understood. In 
a number of the known examples of nongenic inheritance in the 
laboratory animal, such inheritance has been shown to be related 
to age of parents. For example, Lansing (1947, 1954) has shown 
for the rotifer that length of life of the offspring is related to the 
age of the parent at the time when the egg is produced. ‘The pres- 
ence of an “aging factor’ which increases in the body of the parent 
and can be transmitted to the offspring via the egg is postulated. 
A similar type of relationship has been found by Strong (1954) 
in studying the latent period of fibrosarcoma development fol- 
lowing injection of methylcholanthrene into mice. 

No mechanisms of this type have been postulated for man, 
though average length of life in man has been shown to be shorter 
for the children of older than of younger parents (Beeton and 
Pearson, 1901; Dublin and Marks, 1941; Jalavisto, 1950). 

Maternal changes. Variation in maternal anatomy, physiology 
and psychology have evident effects on the intrauterine and post- 
natal environment of the offspring. This variation may be related 
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to age and parity. The association of the prevalence of multi- 
ovular pregnancy with maternal age must be presumed to have 
either an anatomic or a physiologic basis. It has been postulated 
that the development of mongolism may be in part dependent on 
local structural anomalies of the maternal uterus which are more 
frequent in older mothers. The evidence to support this ex- 
planation with respect to mongolism is meager, but mechanisms 
of this type are readily conceivable, for example, with respect to 
syndromes resulting from injury during the birth process. 

There are also maternal changes, independent of age, associ- 
ated with changes in parity, for example, local anatomic changes 
affecting the uterus. Such changes are likely explanations of the 
positive association between birth order and birth weight of the 
offspring. Changes in maternal psychology affecting the postnatal 
environment of the infant may be related to age or birth order. 
It seems possible that the high incidence of infantile pyloric 
stenosis in first births may be explained in this way, since post- 
natal environmental influences affecting the development of the 
disease seem to be closely related to maternal care (McKeown, 
et al., 1955). In this connection, a potentially fruitful field of in- 
quiry would seem to be in more comprehensive examination of 
the influence of order of birth and parental age in the determina- 
tion of personality traits. 

Social determinants. Social factors productive of birth order 
and parental age associations are also complex. A distinction must 
first of all be made between associations which identify the dis- 
ease as occurring in sibships belonging to a certain (social) type 
and associations which result from changes occurring within the 
sibship. 

For example, if a cross section of all births occurring in a com- 
munity is examined, first births will be found to be on the aver- 
age about a quarter of a pound lighter than second births, which 
in turn will be lighter than third and later births. This could re- 
sult from increasing birth weight being directly related to in- 
creasing birth order, or from a tendency for women of limited 
fertility to have smaller babies. In the latter case the early birth 
ranks would include a high proportion of relatively infertile 
women, and consequently a high proportion of small babies. ‘The 
distinction between these two types of relationship can in this 
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case be made quite readily by examination of the mean weights 
of first and second births of the same mothers. It has in fact been 
found that second births are on the average heavier than are the 
first births of the same mothers, which makes unnecessary the 
alternative explanation—a possible association between birth 
weight and infertility. However, examination of successive births 
to the same mothers is not always easy. So long as a characteristic 
common to all births (e.g., birth weight) is being considered, num- 
bers of serial observations need not be large, but the difficulty may 
be insurmountable when a larger number is required, such as in 
the examination of a specific disease. 

A further example of this situation is provided by the fact that 
patterns of reproduction commonly vary with economic status. 
Thus, in Western countries, it is common for women in the lower 
economic groups to reproduce more frequently and at earlier ages 
than women in the middle and wealthy classes. Consequently in 
a cross section of births, births in the higher birth orders and 
lower maternal ages, and particularly births of higher than aver- 
age birth order for a given maternal age, will be unduly weighted 
with births to women in the lower economic groups. A disease 
which is associated with economic status will therefore tend nec- 
essarily to show an association with age and birth order. Factors 
of this type undoubtedly enter into the complex patterns of par- 
ental age and birth order associations manifested by fetal and in- 
fant death rates, although, as noted above, it is not easy to de- 
termine which features of such patterns would be found in an 
economically homogeneous group of families. 

There are also social circumstances which may be peculiar to 
the sibship. For example, parental social standing, income and 
child-rearing practices are unlikely to remain static within the 
sibship as age and parity increase. Further, being born into a sib- 
ship already containing children is a determinant of the timing 
of the common communicable diseases among the later-born sib- 
lings. 

Much of this discussion of factors explanatory of birth order or 
parental age associations has been speculative. Although Duncan 
pointed out the role of maternal age in the frequency of twins as 
far back as 1865, and Galton (1892) aroused considerable inter- 
est in the importance of order of birth toward the end of the last 
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century, the total amount of statistically adequate work that has 
been done with these variables has been relatively small and 
highly concentrated in a few areas such as twins, mongolism, 
pyloric stenosis and fetal and infant death. The accumulation of 
information referable to a broader spectrum of outcome will pre- 
sumably lead to increasing clarification of explanatory mechan- 
isms. 


LITTER SIZE 


Discussion of prenatal influences would be incomplete without 
mention of one other variable that may be considered descriptive 
of the prenatal environment — litter size. That the intrauterine 
environment is vastly different for twins than for single births 
perhaps needs no authentication, but is demonstrated by the 
striking decrease in birth weight associated with increasing litter 
size (Figure 36). It seems possible that environmental influences 
capable of producing such striking variation in birth weight may 
also be capable of producing other characteristics and disorders, 
particularly those that may arise during the later months of preg- 
nancy when the crowding would appear to be most severe. 

The method of determining whether twins are abnormally fre- 
quent in a particular disease has usually been to assemble a large 
series of cases of the defect, to note how many of the affected 
children were born of a twin pregnancy, and to compare the ob- 
served number of twins with an expected number. The expected 
frequency of a twin confinement among a typical group of white 
births in the United States is approximately | per cent; among a 
typical group of newborn infants the proportion of twins will 
therefore be approximately 2 per cent. Adjustment for birth or- 
der and maternal age may be necessary, since the frequency of 
twins among confinements is known to be influenced by these 
variables. If the proportion of twins among a series of affected in- 
dividuals differs significantly from that expected, the condition 
may accordingly be more or less common in twins than in single 
births. This inference is justifiable when dealing with a series of 
cases assembled at birth and in which both stillborn and liveborn 
affected individuals are included. However, since the perinatal 
mortality of twins is higher than that of single births, some ad- 
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Ficure 36. Median weight of livebirths from single, twin and triplet preg- 
nancies by sex, United States, white births, 1957. 


justment is required if the series is restricted to livebirths or to 
the neonatal period. Since little is known of the mortality of 
twins after the neonatal period, adjustment may not be possible 
if the series is assembled at a time distant from birth. This cir- 
cumstance has limited the use of this type of investigation al- 
most exclusively to studies of the congenital malformations. 


15 
Analytic Epidemiology: 
Cohort Studies 


Cohort studies form part of the general area of analytic epi- 
demiology, being studies designed predominantly to test a specific 
hypothesis or a number of specific hypotheses. As noted in Chap- 
ter 4, cohort studies involve the longitudinal study of groups iden- 
tified in some way other than by exhibiting the characteristic 
whose etiology is being investigated. These groups are defined 
through division into those possessing and those not possessing 
some exposure or characteristic prior to the appearance of the dis- 
ease or characteristic under investigation. 


IDENTIFICATION OF STUDY GROUPS 


An important methodologic difference exists between cohort 
studies in which the study group is the general population, or 
some readily accessible part of the general population, and those 
in which the study group is a group identified as having been ex- 
posed to some special risk. In the first case, data are required to 
separate the general population into subgroups according to level 
of exposure to various factors. The major problem in the second 
situation is the identification of nonexposed groups whose experi- 
ence can be compared with that of the special exposure group. 

Like most of the classifications of epidemiologic methods that 
we have attempted, the division of cohort studies into those of 
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general populations and those of special exposure groups is not 
invariably clear-cut. For example, the follow-up studies of the 
populations of Hiroshima and Nagasaki are studies of general 
populations that might equally well be regarded as special ex- 
posure groups. However, since these studies followed a sequence 
of first identifying a general population and then collecting data 
to stratify the population according to level of exposure, for pur- 
poses of classifying the epidemiologic method involved they 
may be considered to resemble general population studies more 
than special exposure group studies. 


Studies of General Population Groups 


A major difficulty of cohort studies of general populations is 
the large number of persons who must be followed to obtain ade- 
quate numbers of persons developing any particular disease dur- 
ing the course of the study. For example, in order to obtain 50 
deaths from coronary heart disease during the course of a year’s 
study, it would be necessary to follow 25,000 persons, on the basis 
of current death rates for this disease in the United States. Even 
if the study were restricted to the groups showing the highest 
rates, say, males aged 35 years or older, it would still be necessary 
to follow 10,000 persons to obtain 50 deaths. And, it should be re- 
called, coronary heart disease is one of the commonest disease 
manifestations in the United States at the present time. For most 
diseases, therefore, this problem would be much greater. For ex- 
ample, in order to expect 50 cases of leukemia during a year's 
study, it would be necessary to follow nearly one million people. 
These numbers can be reduced by longer follow-up, but prolonged 
follow-up raises its own problems. 

Nevertheless, a number of general population studies have 
been attempted. Some of the different types of groups followed 
have been: 


(1) Whole communities 

(2) Occupational subgroups 
(3) Medical care subgroups 
(4) Insured persons 

(5) Self-selected groups 

(6) Obstetric populations 
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(1) Whole communities. To study the occurrence of arterio- 
sclerotic and hypertensive heart disease, a cohort study of the 
population of Framingham, Massachusetts, was established in 
1950 (Dawber et al., 1951). Framingham was, at that time, a 
town of 28,000 inhabitants in eastern Massachusetts, sufficiently 
distant from Metropolitan Boston to retain its identity as a sepa- 
rate community and sufficiently large to be expected to yield an 
adequate number of cases. The study was restricted to the group 
expected to show the highest incidence — males, aged 30 to 60 
years. The study was further restricted to a two-thirds sample of 
males aged 30 to 60. However, since the sample was intended to 
be a random one, the study was clearly meant to yield findings 
representative of males aged 30 to 60 in the whole community. 
Of the 6510 persons eligible for inclusion in the sample, 4469 
were finally included in the study. 

The Atomic Bomb Casualty Commission has carried out fol- 
low-up studies on the very large numbers of the Hiroshima and 
Nagasaki populations who survived the atomic bombs of 1945 
(Lange et al., 1954). In these studies, 125,041 and 111,135 people 
in the respective cities were identified; particular interest has cen- 
tered on the manifestation of leukemia. 

(2) Occupational subgroups. The occupations referred to 
here are not those known or believed to confer special risk of the 
disease under investigation. They are occupational groups that, 
with respect to the disease in question, are not known to be un- 
representative of the general population in any relevant respect 
and which offer special facilities for the collection of data or fol- 
low-up. For example, Doll and Hill (1954) studied physicians 
on the Medical Register of the United Kingdom in 1951. These 
physicians represented a subgroup of the population who, be- 
cause of the legal obligation to maintain registration, were rela- 
tively easy to identify and trace, who might be expected to receive 
better than average medical care (and hence diagnosis), and who 
moreover might sympathize with the objectives of the study. ‘The 
frequency of lung cancer, the manifestation of interest in the in- 
quiry, was not unrepresentative of that of the general population. 
Doyle et al. (1957) describe a cohort study of male civil service 
employees in Albany, New York, who because of stability of oc- 
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cupation and accessibility seemed a suitable group among whom 
to study the occurrence of heart disease. 

Workers in industrial plants also provide populations that are 
readily accessible and easily followed. However, the number of 
persons in any one plant is unlikely to be sufficiently large for the 
study of specific diseases, unless some special risk is present in the 
plant. The records of unions (particularly those providing medi- 
cal care programs) may be useful for this purpose, although they 
have been little used so far. 

(3) Medical care subgroups. Persons enrolled in prepaid 
medical care plans such as union-sponsored medical care programs 
and the Health Insurance Plan of Greater New York are also 
more readily followed than the general population. The prepay- 
ment component of such plans not only insures that those en- 
rolled will seek the majority of their medical care inside the pro- 
gram, but also provides, through payment of dues, a mechanism 
for annual follow-up. 

(4) Insured persons. The many studies of life insurance sta- 
tistics might be classified as cohort studies. Persons taking out 
insurance policies between certain dates constitute the enter- 
ing cohort, whose subsequent mortality is measured by claims. A 
special study utilizing such a population group is that of Dorn 
(1959b) on the effect of smoking on mortality among policyhold- 
ers of U.S. government life insurance. These policyholders were 
veterans, primarily from World War I. The cohort consisted of 
all such policyholders known to be alive in July 1954. This study 
differs from most insurance studies in that entrance into the co- 
hort was determined not by the time when the insurance was taken 
out, but by virtue of its members being alive at a specified time, 
the insurance having been taken out at some time in the past. 

(5) Self-selected groups. Although self-selection enters to 
some extent into all the types of study described above, there are 
some groups made up entirely of volunteers. For example, Ham- 
mond and Horn (1958) assembled a population of 189,854 white 
men between the ages of 50 and 69 by asking 22,000 American 
Cancer Society female workers (themselves volunteers) to have a 
questionnaire filled out by about 10 such men “whom she knew 
well and would be able to trace.” 
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An attractive feature of self-selected groups such as these is the. 
relative ease of obtaining detailed information, since willingness 
to supply information is implied by the person’s agreement to 
enter the study. The possibility of bias resulting from self-selec- 
tion, however, is a major problem. For example, it is possible 
that the American Cancer Society volunteers, having suspicion 
that smoking was injurious to health, might have tended to select 
men who not only were agreeable to entering the study, but who 
also were heavy smokers and in lower than average states of 
health. Although individuals who had gross illness were excluded 
from the cohort, this might not eliminate the difficulty entirely. 
Such a bias, it may be noted, would be most likely to have an 
effect early in the follow-up; the longer the interval between the 
identification of the cohort and the observation of an associated 
morbid outcome, the less chance there is that the association is 
due to an initial bias in selection of cohort members. 

(6) Obstetric populations. Possible roles of prenatal experi- 
ences in the production of defect or disease noted at birth are 
ideal subjects for cohort studies. In the first place, the period of 
follow-up is short. Secondly, the delivery and recording of out- 
come are frequently the responsibility of the same agency (phy- 
sician or hospital) as that which undertook the measurement of 
exposure during pregnancy. The two circumstances, indeed, are 
often recorded in the same medical record. 

The Cerebral Palsy Collaborative Study being conducted by 
the National Institute of Neurological Diseases and Blindness is 
an extensive study of this type. The cohort consists of approxi- 
mately 40,000 women registered during pregnancy at one of fif- 
teen hospitals located throughout the United States. The offspring 
of these pregnancies will be observed for at least the first 6 years 
of life (Masland, 1958). | 

Other cohort studies have involved pregnant women registered 
in individual hospitals and observed at delivery for evidence of 
fetal defect (McDonald, 1958). 

Data on exposure. After the general population group to be 
studied has been identified, certain data must be collected regard- 
ing its individual members. Data will be required: 

(1) to classify the group into subgroups according to level 
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of exposure to the factors or factors whose etiologic influence is 
being investigated; and 

(2) to determine whether the exposure groups so derived 
are similar with regard to such demographic variables as age, sex, 
and race which might influence occurrence of the disease under 
investigation; and 

(3) if they are dissimilar in one or more of these respects, 
to enable allowance for such differences in the analysis. 

Such data must be available not only for those who subse- 
quently develop the disease manifestation in question but for 
the whole cohort. This problem is perhaps simplest in studies 
using obstetric data, where the records often routinely include 
data on the occurrence of such events during pregnancy as threat- 
ened abortion, hormone therapy, maternal chronic disease, pre- 
natal X-rays, and a variety of demographic data such as race, 
maternal age, and parity. For the determination of information 
which would not be expected in routine records, direct access to 
the pregnant women may be needed. In the Cerebral Palsy Col- 
laborative Study, for example, special interviews with mothers 
are being conducted together with comprehensive physical ex- 
aminations and biochemical and other laboratory tests. 

When special data are required on less accessible populations 
the method is more difficult. For example, the cohort studies of 
lung cancer previously mentioned required detailed information 
on smoking habits. In the American Cancer Society study, and in 
studies of insurance data, such data can be obtained from the in- 
dividuals when they enter the cohort. However, this advantage 
in ready access to data on level of exposure is more than counter- 
balanced by the self-selected nature of the group. Doll and Hill 
(for the British physicians) and Dorn (for the American veter- 
ans) obtained the data by mail. Response to a request for infor- 
mation was almost identical in the two studies — 68 per cent of 
the replies were usable; however, Dorn was able to increase this 
figure to 83 per cent by a second questionnaire to those who had 
failed to answer the first. 

If data are not obtained for a large proportion of the cohort, 
the question of bias arises just as in self-selected cohorts. ‘This 
problem increases as the data of interest become more personal. 


216 Epidemiologic Methods 


For example, in the Framingham population sample certain of 
the data could only be obtained by physical examination and 
these examinations were not obtained for 31 per cent of the per- 
sons in the sample. The primary reason for failure was lack of 
cooperation on the part of the person to be examined. Almost 
certainly the 31 per cent of the sample population for whom the 
data were not obtained differed from those for whom they were 
obtained in demographic, psychological and physical characteris- 
tics. 

The problem of self-selection entering into such studies is usu- 
ally not as desperate as in those in which the whole cohort is 
self-selected, since there is often some opportunity to assess the 
nature of the bias that has entered into the selection and to judge 
its possible influence on the results obtained. 

This assessment consists, first, of comparison of respondents 
with nonrespondents with respect to such descriptive data (pos- 
sibly available from other sources) as age, sex and race. ‘This in- 
formation might be found in available records, for example, of 
veterans or industrial groups, or might be obtained by inference. 
For example, if the age distribution of a community’s population 
is known and the age distribution of respondents is subsequently 
obtained, the age distribution of the nonrespondents can be ob- 
tained by subtraction. It is important also to compare respondents 
and nonrespondents with respect to outcome, if this can be ob- 
tained for nonrespondents. 

With respect to the yield likely from various sources of infor- 
mation, the Hunterdon County survey is of interest (Trussell et 
al., 1956). In this survey information concerning medical status 
was compared according to source. The percentages of persons for 
whom information was obtained from the various sources were as 
follows: by interview at home, 91 per cent; by medical examina- 
tion, 72 per cent; by mail questionnaire, 56 per cent; by multi- 
ple screening for chronic disease, 34 per cent. Naturally, such 
figures will vary according to the information sought, the group in 
question, and a wide variety of other factors. In one study of a rel- 
atively small employment group, 89 per cent of the eligible em- 
ployees underwent a comprehensive medical examination (Doyle 
ef a)., 1957). 


Such studies as the one in Framingham, involving periodic in- 
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terviews and examinations of members of the cohort, allow op- 
portunities for periodic revision of exposure classes according to 
changes in information concerning its members. In other types of 
cohort study, such as those of lung cancer, only one assessment 
of exposure may be made — that at the beginning of the study. 
Appreciable groups of individuals, however, may change their ex- 
posure class between the time of assessment and the end of a 
long-term study. In long studies periodic reassessment provides a 
way of overcoming this problem. In any event, it should be noted 
that changes which occur but which remain unassessed will tend 
to reduce any actual association that exists between exposure and 
development of the disease. In other words, the real association is 
likely to be greater than that found. In fact, a conservative ap- 
proach, which leans to reduction rather than exaggeration of as- 
sociation, would suggest that it is preferable to utilize the initially 
determined exposure groups throughout rather than to risk the 
alteration of an exposure group being influenced by later knowl- 
edge of outcome. In studies of exceptionally long duration this 
generalization may not be applicable. However, in such a situa- 
tion, separate examination of individuals who have changed ex- 
posure groups during the course of the study would still be in- 
dicated. 


Studies of Special Exposure Groups 


Cohort studies of general population groups yield certain in- 
formation that can be obtained in no other way, and when the 
factor under examination is one to which a large proportion of 
the population is exposed (e.g., cigarette smoking), they may be 
the most economical form of cohort study possible. However, 
when the factor under examination is one to which only a small 
proportion of the population is exposed, greatest value for money 
is obtained by identifying groups who have experienced some un- 
usual exposure and comparing their disease experience with other 
selected groups. 

A number of such studies have been made in recent years of 
persons receiving large doses of X-rays for medical reasons, for 
example, ankylosing spondylitis (Court-Brown and Doll, 1957) 
or thymic enlargement in infancy (Simpson et al., 1955), to de- 
termine whether such groups experience greater than average 
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risks of leukemia and other cancers. The study of Court-Brown 
and Doll is of particular interest in that the cohort was assem- 
bled retrospectively. Assembled in 1955, the cohort consisted of 
13,352 patients given X-ray therapy for ankylosing spondylitis in 
one of eighty-one British therapy centers between 1934 and 1954. 
The outcome measured was the occurrence of death from leuke- 
mia or aplastic anemia at some time between 1935 and 1954. An- 
other example of retrospective definition of a cohort is the study 
by Case and Lea (1955) of the mortality of a cohort of men de- 
fined in 1930 by virtue of receiving pensions for the effects of 
mustard gas poisoning in World War I (see page 46). This ap- 
proach adds the major practical advantage of the case history 
method — immediate availability of data — to the theoretical ad- 
vantages of the cohort study. It is an efficient and economical 
method and is being used increasingly. 

Other examples of cohort studies of special exposure groups 
include follow-up studies of infants delivered prematurely and of 
specially exposed industrial populations such as the investigation 
by Case et al. (1954) of workers in the dyestuffs industry. 

Selection of comparison groups. As for cohorts drawn from gen- 

eral populations, it is necessary to assemble certain data on the 
members of a cohort of specially exposed persons. In the first 
place, data on age, sex and racial distribution (and possibly 
other features) are necessary for the interpretation of disease 
rates. Secondly, while it may be assumed that all members of the 
cohort are exposed, it may be desirable to attempt a stratification 
according to degree of exposure. For example, Court-Brown and 
Doll attempted to determine the dose of X-rays received by the 
spinal marrow for each of the patients in the cohort with ankylos- 
ing spondylitis. The discussion of methods of assembling such 
data in general population studies applies also to studies of spe- 
cial exposure groups. 

Unlike general population studies, special exposure cohorts re- 
quire the selection of a group of nonexposed persons with which 
the outcome in the specially exposed group can be compared. 
The problem is analogous to that of selection of comparison 
groups in case history studies, although the types of groups se- 
lected are generally different. There are two possibilities: 
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(1) Comparison may be made with rates existing in the 
general population at the time the cohort is being followed. 

(2) Comparison may be made with the experience of another 
special cohort, similar in demographic characteristics to the ex- 
posed group, but not exposed. 

In comparisons of the specially exposed cohort with the gen- 
eral population, as done for example by Court-Brown and Doll 
with respect to risk of death from leukemia or aplastic anemia, 
rates of the specific outcome in the cohort may be compared with 
those in the general population. However, since it is nearly always 
necessary to make the comparison specific for age and sex, most 
specially exposed cohorts will be too small for the derivation of 
reliable rates. 

It is more usual to compare the observed outcome (say, num- 
ber of deaths from leukemia) with that which would have been 
expected in a group of people of the same age and sex distribu- 
tion as the cohort if the age-and-sex-specific general population 
rates had prevailed. In the study of ankylosing spondylitis, and in 
any study in which the study cohort includes all persons with the 
specific exposure in the population from which the general rates 
are derived, this is an acceptable procedure; however, when this 
is not the case, it is necessary to consider whether the manner of 
selection of the cohort might produce differences in disease rates 
that are unrelated to exposure. The likelihood of differences aris- 
ing in this way will vary with the manner in which the cohort is 
assembled and with the extent to which the frequency of the out- 
come being measured varies between subgroups of the popula- 
tion. 

Comparison with a similar though not exposed cohort might be 
considered a preferable approach when the exposed cohort does 
not represent all the exposed persons in a defined population. 
For example, in industrially exposed persons, persons in other 
occupations in the same industry may be suitable; Simpson and 
her colleagues, as well as deriving expected numbers of cases 
from population rates, compared the frequency of neoplasms in 
children irradiated for thymus enlargement with the frequency of 
neoplasms in their siblings; Seltser and Sartwell (1959) compared 
the mortality of cohorts of radiologists with those of similarly 
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defined cohorts of pathologists. In theory, an ideal comparison co- 

hort for the studies of patients irradiated for ankylosing spondy- 

litis and thymic enlargement would be patients with ankylos- 

ing spondylitis or thymic enlargement who were not irradiated, 

since such comparison would also rule out the possibility that 
ankylosing spondylitis or thymic enlargement per se were re- 

sponsible for any unusual outcome noted. Such groups, however, 

are often hard to come by. 

Studies of cohorts identified at birth offer favorable opportuni- 
ties for selection of comparison cohorts because of the ready 
identification of the related population. Thus, cohorts of birth- 
injured children might be compared with cohorts of infants se- 
lected at random or on a paired basis from the same population of 
births (see page 243). 

For purposes of comparison with a specific cohort of exposed 
persons, it may be that neither general population rates nor an- 
other special group may be entirely satisfactory. A study may never- 
theless be of value. In the first place, the study may be dealing 
with persons receiving much larger than average exposures to the 
agent under investigation, and large increases in incidence of the 
disease under investigation may be expected. ‘There may be little 
interest in the demonstration of minor increases. Secondly, co- 
horts of persons exposed to certain agents are not frequent and 
all data that can be obtained from them, even if imperfect, may 
be of value. If firm conclusions cannot be drawn from the results 
of one study, the sum of several studies of different cohorts ex- 
posed to the same agent may be convincing. For example, the re- 
sults of the study by Simpson et al. (1955) of children irradiated 
for thymic enlargement are shown in Table 32. Comparison with 
the general population rates can be criticized on the basis of the 
wide variation in incidence of Jeukemia and thyroid cancer with 
regard to other variables (such as geography, ethnic group, and 
economic group) which were not accounted for in the calculation 
of the expected number of cases. Comparison with siblings can 
also be criticized on the grounds discussed in the next chapter 
(page 238). However, the differences noted are much larger than 
those that might reasonably be accounted for on the basis of lack 
of control for these other variables. We may note that the ob- 
served cases of cancer, other than leukemia and thyroid cancer, 
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TABLE 32 


Frequency of leukemia and thyroid cancer among children treated by 
X-ray in infancy for thymic enlargement * 


Irradiation No irradiation 
(1722 children) (1795 siblings) 
Disease rer 
Number Number Number Number 
observed expected observed expected 
Leukemia 7 0.6 0 0.6 
Thyroid cancer 6 0.1 0 0.1 
Other cancer 4 1.9 5 2.0 
All cancer 17 2.6 5 27 


* From Simpson et al. (1955). 


exceed the expected number of cases in both the irradiated 
eroup and their siblings. This might be interpreted as evidence 
of more thorough case-finding among the two cohorts than among 
the general population which furnished the rates for calculation 
of the expected numbers. However, the extent of this possible 
bias, as measured in the “other cancer” group, is by no means 
large enough to explain the relative excess of leukemia and 
thyroid cancer among the irradiated group. In addition, the con- 
sistency of these findings with those of other studies linking in- 
creased cancer risk to exposure to ionizing radiation supports 
the contention that the increased risk experienced by therapeuti- 
cally irradiated infants, at least with respect to thyroid cancer 
and leukemia, is quite probably a real one. 


FOLLOW-UP 


Whether the cohorts to be studied were drawn from a general 
population or consist of a special exposure group and appropriate 
comparison groups, there remains the task of determining the 
outcome among the various groups. In most instances the out- 
come to be ascertained will be appearance of either morbidity or 
death from a certain disease manifestation or group of manifesta- 
tions. It is clear that completeness of such ascertainment should 
not be unequally affected by membership or nonmembership in 
a particular exposure category. This objective is most easily at- 
tained when the method of ascertainment or follow-up depends 
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on a mechanism completely separate from that by which exposure 
was categorized. Perhaps most satisfactory of all is the use of some 
routine procedure that is undergone by all affected persons re- 
gardless of the fact that they are members of a special study 
group. For example, in Doll and Hill’s and in Dorn’s studies of 
cigarette smoking, data on exposure were obtained from the in- 
dividuals constituting the cohort, but the fact of these individuals 
dying and the causes of death were obtained from routine records. 
In the former study, copies of all death certificates on which there 
was indication that the decedent was a physician were sent to the 
study group; in the latter, claims for insurance benefits were the 
primary sources of information. In both, these primary sources 
were supplemented by inquiry of hospitals or physicians caring 
for the patient prior to death. 

Routine records of death requisite for the termination of a pen- 
sion, those kept by industry, and those kept by treatment centers 
were the respective mechanisms of follow-up for the studies of 
mustard gas pensioners, dyestuffs workers and patients with 
ankylosing spondylitis referred to earlier. For the patients with 
ankylosing spondylitis, the routine records of the treatment cen- 
ters were supplemented by a search of death certificates of the 
general population for matches between the cohort members and 
persons who had died of leukemia or aplastic anemia. Such meth- 
ods are economical since they do not require location and inter- 
view of each individual cohort member. 

To be efficient the routine source of information utilized must 
of course be one that will reveal the great majority of the cases 
of disease or death that actually occur in the cohort. For ex- 
ample, certain of these sources might not provide data on those 
members of the cohort who, because of emigration, change of 
occupation, or other reasons, are no longer under observation 
(even indirect observation). Under these circumstances it may 
be uncertain how many cohort members actually are at risk of 
the outcomes under investigation, and the importance of assessing 
such influences before deciding on this enticingly economical 
method is obvious. 

While satisfactory as a measure of the occurrence or nonoccur- 
rence of such diseases as leukemia and lung cancer, death is often 
too crude an index of outcome. For example, in the Framingham 
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study and in the Cerebral Palsy Collaborative Study, indices of 
heart disease and of infant defect are sought that frequently can 
only be obtained by specialized clinical examination of living in- 
dividual cohort members. Such studies call therefore for regular 
periodic clinical examinations of each member of the cohort. ‘This 
method yields a great deal more information for the individuals 
examined than would the use of any routine records. However, it 
has two disadvantages: 

(1) The proportion of individuals examined decreases with 
the passing years and the membership of the cohort becomes more 
and more self-selected. The extent and nature of such self-selec- 
tion can be estimated by comparing cohort members for whom 
clinical examinations were obtained with those for whom they 
were not, with respect to their demographic characteristics and 
crude criteria of outcome not requiring patient contact, such as 
record of subsequent hospitalization or death. 

(2) There is some danger of exposure class and diagnosis of 
outcome becoming confounded through lack of independency of 
the observations. This risk might be reduced by such means as not 
permitting the diagnostician access to past records or by not per- 
mitting him to delay diagnostic appraisal. But it is difficult for a 
physician to remain ignorant of the antecedents of a patient he is 
examining, or for him to make a diagnosis in the absence of in- 
formation he considers desirable. The elimination of bias from 
this source is extremely difficult. “Blind” readings, the use of 
objective biochemical tests and other measures familiar to the de- 
signers of therapeutic trials should be appropriate, but their 
practicality in this instance has been tested infrequently. 


ANALYSIS 


Basically, analysis of data from cohort studies involves deriva- 
tion of rates of a specified outcome among the cohorts under 
study. These rates can be compared between various exposure 
groups of the same general population cohort or between exposed 
and nonexposed cohorts. Alternatively, the number of cases of a 
disease observed in a specially exposed cohort can be compared 
with the number expected on the basis of general population 
data. The requirement that the groups compared be similar with 
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respect to age and other pertinent variables has been stressed in 
earlier chapters and is equally applicable here. 


Person-Years of Risk 


A frequently used denominator for calculating rates of the 
specified outcome is person-years of risk. This denominator takes 
simultaneously into consideration the number of persons at risk 
and the duration of risk of each person. For example, if 10 per- 
sons remain in the study for 10 years, there are said to be 100 
person-years of risk. The same figure would be derived if 100 per- 
sons were at risk for 1 year or if 200 persons were at risk for 6 
months. The practical consideration which makes the use of this 
concept a convenience is that a cohort may not retain the same 
strength during the whole of the period for which outcome is be- 
ing measured: 

(1) Entrance dates may vary. For example, veterans answer- 
ing Dorn’s second questionnaire on smoking entered the cohort 
some 3 years after those answering the first. Patients with ankylos- 
ing spondylitis entered the British cohort at various times between 
1935 and 1954. 

(2) During the course of the study certain individuals will 
drop out from the “at risk” category because of death, loss from 
the cohort, or other reasons. 

In addition to changes in strength, changes in the age distribu- 
tion of the cohort will necessarily occur as its members are fol- 
lowed in time. If the changes in age occurred equally in the vari- 
ous exposure groups they might, for purposes of comparison 
between exposure groups, be ignored. However, an assumption 
that all groups will be equally affected by age changes is rarely a 
sound one. Separate calculations of risk which take into account 
differences due to both changing strength and age are nearly al- 
ways indicated. 

Calculation. The data of Doll and Hill (1956), shown in Table 
33, are ilustrative of the person-years of risk method. No new 
physicians were admitted to the cohort after the beginning date 
(November 1, 1951), and the changes in age distribution are 
therefore due solely to the aging of the cohort and the occurrence 
of deaths. It should perhaps be made clear that the derivation of 
the table did not involve repeated censuses of the cohort, but was 
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based on knowledge of the ages of its members on November I, 
1951, and on the data that were being routinely assembled on its 
decedents. 

TABLE 33 


Number of men at risk according to age at successive anniversaries of 
entering the study * 


Number of men at risk on 


Age (years) Person- 
at specified Noy.1, Nov.1, Nov.1, Nov.1, Nov.1, Apr.1, years 
date 1951 1952 1953 1954 1955 1956  atrisk+ 
Under 35 10,140 9,145 8,232 7,389 6,281 5,779 35,489 
35-44 8886 9,149 9,287 9,414 9,710 9,796 41,211 
45-54 7170s 7,257 a 7,351 7215 Fin 32,156 
55-64 4,094 4,212 4,375 4,601 5,057 «5,243 19,909 
65-74 9604 2754 289% 2875 . 2,902 2.928 12,462 
75-84 1,382 1,433 1,457 1,485 1,483 1,513 6,431 
85 and over 181 200 293 256 278 296 1,028 
All ages 34,494 34,150 33,778 33,369 32,926 32,746 148,686 


* Data from a cohort study of British physicians (Doll and Hill, 1956). 
+ See text for method of calculation. 


The person-years of risk, shown in the last column, were de- 
rived as follows: In the age group under 35 years, there were 
10,140 men alive on November 1, 1951, and 9145 alive on No- 
vember 1, 1952. Therefore, if death occurred evenly through the 
year there were on average 9643 men alive during the first year 
of the study and these contributed 9643 person-years at risk. 
Similarly, during the second, third, and fourth years there were 
an average of 8688, 7811, and 6835 men at risk, respectively. Dur- 
ing the last period (5 months) the average number at risk was 
6030 x 5/12 — 2512 person-years. The person-years at risk during 
the entire period of observation was, therefore, 9643 +- 8688 + 
7811 +. 6835 + 2512 ora total of 35,489 person-years. 

Calculations made separately for exposure groups enabling 
comparisons of outcome are shown in Table 34. The summary 
rate (total) for each exposure group was derived by standardizing 
rates for each age group to the age distribution of the population 
of the United Kingdom in 1951. 

Difficulties. The major difficulty in the use of person-years 1s 
that tests of statistical significance cannot be applied to the re- 
sulting rates. The individual person-years constituting the de- 
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nominator do not satisfy the criterion of independence from each 
other necessary for the routine tests. For example, 10 person-years 
contributed by 1 individual are likely to demonstrate more sim- 
ilar experiences than the same number of person-years derived 
from the experience of 10 individuals during 1 year. Further- 
more, there is no practical way of determining to what extent 
complete independence is departed from, and consequently no 
way of making an allowance for it. In general, therefore, the 
determination as to whether observed differences based on per- 
son-years might be due to chance fluctuations must largely be de- 
rived from an inspection which considers the size and pattern 
of the differences. 

Cohort studies do not, of course, necessarily require the use of 
person-years at risk in the denominator of rates. For example, the 
use of person-years at risk is not necessary when the period of risk 
is limited, as for example in studies of pregnancy for fetal de- 
fects, when rates can be expressed per person at risk without 
specification of time (see Chapter 5, page 68). 


Persons Lost to Follow-up 


In certain types of cohort study, particularly those in which 
follow-up information is obtained by periodic medical examina- 
tion or other contact, a number of members of the cohort will 
be lost to trace at each follow-up examination. Such patients can 
no longer be considered at risk and some adjustment of the de- 
nominator of the rate, whether it is persons or person-years, 1s 
required. The difficulty raised by the treatment of persons lost to 
trace is that the probability of loss may be related to the outcome 
being measured. Thus, it is possible that cohort members who 
develop the disease under study may tend to migrate to some 
other geographic area for treatment, or on the other hand they 
may tend to move less than those not affected. It is also possible 
that the probability of loss is related to the exposure group of the 
person — for example, smokers may be more, or less, “restless” 
than nonsmokers. Since it is extremely difficult to measure the 
effect of these considerations, the “only correct method of han- 
dling persons lost to follow-up is not to have any” (Dorn, 1950). 
However, in spite of all efforts that should be, and usually are, 
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made to reduce their number to a minimum, a hard core nearly 
always remains. | 

Ways of assessing the possible influence of losses to follow-up 
on the findings of epidemiologic studies should be considered in 
context with the method of obtaining the follow-up information: 

(1) When follow-up depends on a regularly scheduled ex- 
amination in which a change in status between one examination 
and the next is the outcome being recorded, the most accurate 
method is to assume that persons lost to trace between two ex- 
aminations were lost immediately after the first examination. The 
denominator will then be the number of persons actually ex- 
amined on each occasion (or the number of person-years experi- 
enced between two examinations by persons having the second). 
The reasoning behind this method is that persons lost between 
the two examinations cannot figure in the numerator of the rate 
and thus are inappropriate for inclusion in the denominator. 

(2) When follow-up depends on a certain event occurring 
between two dates and ascertainment of events takes place at the 
time of their occurrence (as, for example, when cohort mem- 
bers are instructed to report certain symptoms) a number of 
methods are possible: 

(a) If the exact date at which the person leaves the co- 
hort is known, an adjustment can be made for the length of time 
he was at risk. 

(b) If it is known only that a person disappeared at some 
time between the two dates, it can be assumed that he was at risk 
for half the period between the two dates. | 

(c) Two denominators can be calculated — one based on 
the assumption that all persons were lost immediately after the 
last date they were known to be present, and one based on the as- 
sumption that they were all lost immediately prior to the first 
date on which they were known to be absent. The correct rate 
must lie somewhere between the rates based on the two extreme 
and opposite assumptions. 

By use of one or the other of these methods, it is usually possi- 
ble to make adequate adjustment for the fact that differences in 
the proportion lost to trace may occur between the various ex- 
posure groups. However, the problem of bias resulting from the 
possible relationship between outcome and probability of follow- 
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up remains. The only solution to this problem is to make two 
assumptions, analogous to paragraph (c): first, none of the per- 
sons lost to trace developed the specific outcome, and second, all 
of the persons lost to trace developed the specific outcome. This 
again yields an estimate of the maximal range due to error of 
this type. The method may be useful in studies such as those of 
prognosis in severe diseases in which the proportion lost to trace 
is small and the frequency of specified outcome (usually death) is 
relatively high. Unfortunately it has little applicability to epi- 
demiologic studies in which the frequency of the outcome being 
measured is commonly smaller than the proportion of persons 
lost to trace, because the range between the two estimates will be 
too large to have any practical value. 


Relative and Attributable Risk 


In the analysis there are at least two measures of the strength 
of association between exposure to a particular factor and risk of 
a certain outcome. These are: 

(1) Relative risk —the ratio of the incidence of the disease 
among those exposed to the incidence among those not exposed. 
Thus, from the data in Table 34, we could say that the risk of 
lung cancer was twenty-four times as great for heavy smokers as 
for nonsmokers (1.66/0.07). This figure is known as the relative 
risk. 

(2) Attributable risk —the absolute incidence of the dis- 
ease in exposed individuals that can be attributed to the expo- 
sure. This measure is derived by subtracting the incidence of 
the disease in question among nonexposed persons from its total 
incidence among exposed individuals. It is assumed that causes of 
the particular disease other than exposure had equal effect on the 
exposed and nonexposed groups. For example, risks attributable 
to smoking in each smoking category, using the data from ‘Table 
34, are shown in Table 35. It is seen that heavy smoking confers a 
risk of death from lung cancer of 1.59 (1.66 minus 0.07) per 
1000 per year; this is 96 per cent of the total risk experienced by 
heavy smokers. 

Similarly an estimate can be derived of the total impact that a 
specified exposure may have on the community with respect to 
a particular outcome. For example, the lung cancer death rate 
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TABLE 35 


Death rates from lung cancer attributable to smoking among British 
male physicians * 


Smoking Person-years Annualdeath Attributable 
category at risk rate per 1000 death ratet 
Nonsmokers 15,107 0.07 0 
Light smokers 38,586 0.47 0.40 
Moderate smokers 36,089 086° 0.79 
Heavy smokers 29,410 1.66 1.59 
Total 113.393 0.81 0.74 


* Data from Doll and Hill (1956). 


+ Death rate in the particular smoking category minus the death rate in non- 
smokers. 


for nonsmokers (0.07) may be subtracted from the total com- 
munity lung cancer death rate (0.81); the result obtained might 
be termed the attributable community risk of lung cancer. The 
rates for the nonexposed (nonsmokers) and exposed (smokers) 
must obviously be based on a population similar in demographic 
composition to the total community. In the illustrative data 
(Table 35) this was accomplished, at least with respect to age, by 
the age standardization from which the total rates for each ex- 
posure group were derived. The concept of attributable commu- 
nity risk is useful in that it provides an estimate of the maximum 
reduction of a particular disease rate that might be expected if 
the specified exposure were removed. On the basis of the data in 
Table 35, one might expect that 91 per cent of deaths from lung 
cancer (an annual death rate of 0.74 per 1000) could be avoided 
if the factor of smoking were eliminated. 


INTERPRETATION 


One of the difficult problems in interpreting the findings of co- 
hort studies is that measures of relative and attributable risk fre-_ 
quently give different impressions as to the importance of the 
same exposure, resulting in uncertainty as to which impression 
is correct. If one is comparing several factors that are etiologically 
significant in the same disease, the same order of importance of 
the factors will be suggested whether relative or attributable risk 
is examined. This is not the case, however, when the significance 
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of the same exposure is assessed for several different manifesta- 
tional entities. For example, data from Doll and Hill’s study 
(shown in Table 36) indicate that the relative risk from heavy 


TABLE 36 


Relative and attributable risk of death from selected causes associated 
with heavy smoking among British male physicians aged 35 or more® 


Death ratet Death rate + 


Cause of death among among heavy Relative Attributable 


nonsmokers smokers ss death rates 
Lung cancer 0.07 1.66 23.1 1.59 
Other cancer 2.04 2.63 ES 0.59 
Other respiratory diseases 0.81 1.41 if 0.60 
Coronary thrombosis 4.22 5,99 1.4 1.77 
Other causes 6.11 7.19 12 1.08 
All causes 13.25 18.84 1.4 4.59 


i 


* Data from Doll and Hill (1956). 
+ Rates are annual death rates per 1000 related population. 


smoking is far greater for lung cancer than for deaths from any 
other group of causes shown, and at the same time indicate that 
the attributable death rate is greater for coronary heart disease 
than for lung cancer. Each of these observations contributes to the 
interpretation, as follows: 

(1) Relative risk gives a clearer idea than does attributa- 
ble risk of the extent to which the observation explains the 
etiology of a specified disease. Thus, although the attributable 
death rate from lung cancer is smaller than that from coronary 
heart disease, the former comprises a far larger proportion of the 
total risk of lung cancer (96 per cent) than does the latter of the 
total risk of coronary heart disease (30 per cent). ‘These per- 
centages are merely different algebraic expressions for relative 
risk. 

(2) Nevertheless, it would appear that heavy smoking causes 
more people to die of coronary disease than of lung cancer. Con- 
sequently, so far as the impact of heavy smoking on a population 
is concerned, the association with coronary heart disease has at 
least as great a public health significance as does that with lung 
cancer. 

(3) Both the above arguments depend on the assumption 
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that the increase in death rate associated with heavy smoking is 
the result of a causal association. Alternative explanations would 
be that the increase resulted from some systematic bias inherent 
in the study method, or that a statistical association existed in 
fact but resulted from association of both heavy smoking and the 
specified types of mortality with some unidentified third variable. 
The size of the relative risk usually is a better index of the proba- 
bility of causal relationship than the size of the attributable risk. 
Thus a difference of 10 per thousand (attributable risk) noted 
between two groups would be less likely to be an error of meas- 
urement if it occurred between rates of 1 and 11 than between 
rates of 110 and 120. It would take less systematic bias to raise a 
rate from 110 to 120 than to raise it from 1 to 11, just as it is easier 
to make an error of one inch in measuring a mile than in measur- 
ing a foot. Furthermore, the probability of an association between 
two variables resulting from a chance association of both with a 
third variable would appear to decrease as relative risk increases, 
since the relative risk through association with the third varia- 
ble must be at least as great as that between the first two. ‘The end 
of the line, however, is not reached until the relative risk attains 
the value of infinity. Such a situation in practice can obtain only 
when man, for reasons of utility that are at the same time har- 
monious with other existing knowledge, constructs by fiat an 
experiential disease entity (such as tuberculosis) where the rela- 
tive risk is, perforce, infinity. 

Both relative and attributable risk may be seen to play differ- 
ent but important roles in the interpretation of the findings of 
cohort studies. Although confusion of these roles is to be avoided, 
there has been no small amount of writing which has pointed to 
the greater attributable risk of smoking for heart disease than for 
lung cancer and has then, after proposing alternate explanations 
of the association with heart disease, implied that similar ex- 
planations might account for the larger relative risk in associa- 
tion with lung cancer. Such exercises seem ludicrous and possibly 
explicable as intentional efforts to cast doubt on an unpopular 
observation. 


14 
Analytic Epidemiology: 
Case History Studies 
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The place of case history studies in the strategy of epidemiol- 
ogy was discussed in Chapter 4. The term is used to describe 
studies that are based on the investigation of the past histories of 
individuals affected with the disease under study. Commonly a 
specific hypothesis is being tested, for example, that a relation- 
ship exists between smoking and lung cancer, or between mater- 
nal rubella and congenital malformation. Sometimes, in the ab- 
sence of leads pointing to specific causal factors, this type of study 
is used in an exploratory way to investigate the total environ- 
mental background of affected persons (to the extent that this 
can be defined and recalled). As pointed out in Chapter 4, an 
important component of case history studies is the concurrent 
investigation of the past histories of a group of individuals who 
are not affected, to obtain estimates of the frequency of the 
suspected causes with which their frequency in affected individ- 
uals may be compared. Therefore, in the planning and conduct- 
ing of case history studies, the following must be considered: (1) 
selection of cases, (2) selection of comparison groups, (3) 
sources of information, (4) analysis, and (5) interpretation. 
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SELECTION OF CASES 


Two procedures are common in the assembly of a series of pa- 
tients for a case history study: 

(1) The inclusion of all cases of the disease seen between 
specified dates at some medical care facility or in the practice of 
a certain physician. 

(2) ‘The attempt to include all cases of the disease existing or 
occurring between specified dates in a limited and defined popu- 
lation, as, for example, in the population of a county, city, or 
other administrative area. 

The first procedure is the more common, since it is relatively 
easy and inexpensive to carry out. Although the second procedure 
is considerably more laborious than the first, since it involves 
special efforts to locate and obtain the necessary data from all 
affected individuals in the chosen population, it is generally more 
satisfactory because it avoids the bias arising from the selective 
factors that lead affected individuals to particular hospitals or 
physicians, and because it simplifies the selection of a suitable 
comparison sample. 

When a defined population is served by a single medical fa- 
cility, it may be possible to obtain the advantages of both these 
methods. For example, in the investigation of congenital diseases 
or of disorders of pregnancy and parturition, the reference popu- 
lation will be a certain number of pregnancies or delivered births, 
and the births occurring in a certain hospital or combination of 
hospitals may constitute the study population. The cases then, 
although limited to selected medical facilities, will at the same 
time be drawn from a defined population. Other situations in 
which a defined population is served by a single medical facility 
include ‘“‘closed”’ populations such as those of schools, service units 
and work places. 

It is preferable that cases included in the series be limited to 
cases newly diagnosed within the specified study period. Although 
the inclusion of all cases existing in the period may, in a chronic 
disease, greatly increase the number of cases available for study, 
the inclusion of cases at a variety of stages in the disease process 
may detract from the quality of the histories obtained and compli- 
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cate the elucidation of time relationships. In patients in advanced 
stages of the disease it may be difficult to differentiate events caus- 
ally related to the disease from those consequent to it (as, for 
example, in a study of the X-ray histories of patients with chronic 
leukemia). The advantages of restricting the series to new Cases 
is particularly clear when study cases can be related to a defined 
population, since this will allow estimates of incidence, rather 
than of prevalence, in relation to the suspected cause. 

The establishment of manifestational diagnostic criteria for in- 
clusion and exclusion of potential study cases is an essential part 
of the design of case history studies. Often it appears wise to in- 
clude manifestational criteria that embrace possible fringes of the 
disease “entity” under consideration. This decision is peculiar to 
each investigation and few generalizations are possible. On the 
whole, those decisions which lean toward clear and reproducible 
applications of definition are probably preferable. If time and 
funds are available, establishment of a number of groups of cases 
based on variously defined criteria is a happy compromise, pro- 
viding the possibility of separate examination of each group of 
criteria. If strong and independent relationships are found, such 
eroupings may assist in judging whether or not certain clinical 
syndromes should be regarded as “‘part’”’ of the disease under in- 
vestigation. 


SELECTION OF COMPARISON GROUPS 


A comparison group is a group of unaffected individuals be- 
lieved to reflect the characteristics of the population from which 
the affected group was drawn. Ideally the comparison group 
should not differ from the affected group in any respect (other 
than not being affected) which might be likely to influence the 
frequency of the variable or variables suspected of being causally 
connected. This means either that both the patient and compari- 
son groups must be representative of the same population or that 
if selective factors enter into the choice of the patients, the same 
factors ought to enter into the selection of the comparison group. 
Two problems will be considered: 

(1) The choice of a large group or sampling frame from 
which to select the comparison group. 


236 Epidemiologic Methods 


(2) The selection of individual comparison group members 
from the chosen sampling frame. 


The Sampling Frame 


The sampling frame for the comparison group usually is de- 
termined by the origin of the series of patients. If the patients 
represent all the affected individuals in a defined population, 
then the comparison group should also be drawn from that popu- 
lation. On the other hand, if the patients are selected by virtue of 
attending a certain hospital or physician, a sample .of the general 
population as not likely to be suitable, since factors that lead pa- 
tients to certain hospitals (such as economic status, race, religion, 
area of residence, etc.) may also be related to the frequency of fac- 
tors suspected of causing the disease. 

When the origin of the series of patients is not the general 
population, it appears necessary to find a comparison group that 
shares as far as possible those selective processes by which the 
patients came to attention. Three common sources of such groups 
are discussed in this section, but these by no means exhaust the 
possible sources of comparison groups. Ingenuity in the selec- 
tion of new bases for comparison is an important part of epidemio- 
logic study design. 

Patients attending the same hospital for other diseases. The 
rationale behind the choice of this comparison group is that it also 
is subject to those selective factors that influenced the patients in 
the study series to come to the particular hospital. For example, 
if the patients to be used in the study are taken from a hospital 
located in an area inhabited by a particular ethnic group, a com- 
parison group drawn from other patients attending the same hos- 
pital is likely to have the same ethnic distribution as the study 
group. 

It is assumed that the factors that lead a patient to come to a 
certain hospital are the same for the disease under investigation 
as for other diseases. This assumption is usually a reasonable one, 
although not invariably so. For example, if a certain hospital has 
a wide reputation for the treatment of a specific disease, it might 
be dangerous to compare its patients with this disease with its 
general patients, since the patients with the specific disease are 
likely to be drawn from a wider geographic area and from a 
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generally higher economic group than those with other diseases. 
A related point is that, since there may be demographic differ- 
ences between the patients of different services even within the 
same hospital, the source of selection of the comparison group 
within the hospital should be as similar as possible to the source 
of the study series. 

The question will arise whether a comparison group should be 
drawn from all patients attending the hospital or only from pa- 
tients with certain groups of diseases that are believed not to be 
influenced by the factors being studied. For example, if the his- 
tories of the smoking habits of a series of patients with lung can- 
cer have been assembled, should these be compared with histories 
of a sample of all patients in the same hospital or with the his- 
tories of a particular diagnostic group, for example, accident 
cases? The appeal of the second course is that if there are diseases 
other than lung cancer that are influenced by smoking, the hospi- 
tal patients will not be representative of the healthy population 
with respect to smoking, and a difference between lung cancer pa- 
tients and other patients may reflect the effect of smoking on dis- 
eases other than lung cancer. On the other hand, if a diagnostic 
group can be identified that is known not to be influenced by 
smoking, the comparison group can be assumed to represent the 
general population, and a difference between the lung cancer 
patients and the comparison group can theoretically be presumed 
to reflect an influence of smoking on lung cancer. 

The practical disadvantage of this course is the difficulty of 
identifying a diagnostic group that is known to be representative 
of the general population with respect to any particular factor. 
Consequently, if a difference between the study series and a com- 
parison group drawn from a single diagnostic category is found, it 
is difficult to decide with confidence which of the compared 
eroups differs from the general population. 

On the whole, a comparison group drawn from all patients is 
probably preferable to one drawn from a single diagnostic cate- 
gory. Since the interpretation of differences between patients in 
the study group and comparison patients depends to a large extent 
on the “reasonableness” of different explanations, it is common to 
use several comparison groups from different sources and to ex- 
amine the patterns that appear. For example, patients with can- 
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cer of the lung may be compared with patients with cancer of 
other sites, patients with diseases other than cancer, and patients 
without known disease who have come to the hospital for diagnos- 
tic procedures. If, in such a study, the lung cancer group differs 
markedly from all the other groups, the evidence is strengthened 
that it is the lung cancer group whose experience with the sus- 
pected factor is unusual. Caution is necessary in the use as a 
comparison group of patients seen at the hospital in whom no dis- 
ease is diagnosed. Probably this is the most unrepresentative of 
any group of patients seen at hospitals. 

Relatives of the study group. The advantages of using relatives 
as a comparison group are the group’s close similarities in ethnic 
and social background to the study group. Two types of relative 
are used most commonly — spouses and siblings. The use of the 
spouse may raise difficulties if there is any possibility of a sex 
difference in the frequency of the manifestation or the suspected 
cause. It is of course possible to make like-sex comparisons (for ex- 
ample, of male patients with the husbands of female patients), but 
such comparisons may be impossible or at least inefficient if there 
is an appreciable excess of patients of one sex in the study group. 

When siblings are used as the source of a comparison group, 
it is necessary to select from among the available siblings on a 
one patient: one sibling basis. The mistake is occasionally made of 
comparing a series of patients with all their available siblings. 
This procedure leads to bias in that the comparison group will be 
heavily weighted with the members of large families and will 
show to an excessive degree (by comparison with the study 
group) any of the multitude of characteristics that are related to 
family size. For the same reason, patients for whom no compari- 
son sibling is available must be excluded; otherwise the study 
group, in contrast to the comparison group, will be weighted 
with one-child families. 

The major disadvantage of using relatives for comparison pur- 
poses is that their number is limited, and it may therefore not be 
possible to select a group that is close to the study group in some 
other important characteristics such as age (see the section on 
matching, page 244). Since many patients may not have any rela- 
tives suitable for inclusion in a comparison group, such patients 
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will have to be excluded, thus removing a proportion of the case 
material from consideration. 

Associates of the study group. Members of the general popula- 
tion may be used on the basis of sharing some common experience 
with the members of the study group, for example, attendance at 
the same school, similar place of employment, or residence in 
the same neighborhood. This type of comparison group combines 
some of the advantages of the first two types, being comprised for 
the most part of healthy individuals and at the same time provid- 
ing a large pool from which individuals with specific characteris- 
tics can be selected if necessary. The major disadvantage of the 
method is the relatively laborious field work that is involved in 
identification of comparison individuals. In addition, if the in- 
formation sought is greatly detailed or is of a highly personal na- 
ture, an investigator may be reluctant to attempt to obtain such 
data on a group to which he does not have the entree that is pro- 
vided by the methods of selection of the first two types of com- 
parison groups. However, there is no doubt that if facilities are 
available a competently selected group of this type is extremely 
satisfactory. 


Selection of Individual Comparison Group Members 


Having selected the source from which the comparison group 
will be drawn, it is necessary to decide whether the data for the 
patients in the study will be compared with data on all the in- 
dividuals who might be available from the chosen source, or 
whether a sample will be selected. If a sample is decided on, a 
choice must be made between the various sampling procedures 
available, the most common of which are random, systematic and 
paired sampling. It is said that the selection of a source of a com- 
parison group is more important than the method of sampling 
individuals from that source (Cochran, 1951). This may be true; 
nevertheless, however satisfactory its source, the usefulness of a 
comparison group can be seriously affected by unsatisfactory 
sampling. 

Situations in which sampling is not involved. When the general 
population from which the patients were drawn can be defined 
and is chosen as the source of a comparison group, it may be pos- 
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sible to use data on the total population. Clearly this is only 
practical when data on the frequency of the factors under ex- 
amination are recorded routinely for the total population. 

One of the situations in which total population data commonly 
form the basis for comparison is in studies of familial occurrence 
— to answer the question of whether death from the same disease 
is more common among blood relatives of affected persons than 
among the general population. Mortality rates observed in the 
general population are applied to the group of relatives to de- 
termine an expected number of deaths which is then compared 
with the actual number observed. It is of course necessary to con- 
sider the number of person-years that the group of relatives has 
been at risk in each age period, and to apply the age-specific rates 
in the general population. Depending on the demographic charac- 
teristics of the disease being examined, it may also be necessary 
to apply rates that are specific for other variables, for example, 
sex and race, and if the disease has shown appreciable secular 
change, even for the period during which the group was at risk. 

In comparing observed and expected numbers of deaths in such 
a study, it is necessary that the basis of ascertainment be the same. 
Thus, if the general population rates are based on general mortal- 
ity statistics, the observed number of deaths must include only 
those in which the death certificate ascribed the death to this 
particular disease in such a way that the death would have been 
so coded in deriving mortality statistics. Additional cases of the 
disease may of course be uncovered among relatives by virtue of 
the special investigations undertaken, but they cannot validly be 
included in a comparison with general population mortality data. 

Another common use of population comparisons is the study of 
maternity hospital data when all births in selected facilities com- 
prise the population at risk. It is becoming increasingly common 
to process, by punch-card methods, data on characteristics of all 
births occurring in certain hospitals. Such data may provide val- 
uable material for comparison with data on a series of abnormal 
pregnancies or births with respect to order of birth, maternal age, 
blood group, abnormalities of prior pregnancies, and so on. The 
number of variables that can be examined is again limited by the 
amount of information that is collected routinely. 

In general, medical histories taken routinely in hospitals are 
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not completed with sufficient consistency to be utilized for study 
of the types of variables usually examined in epidemiologic case 
history studies. Therefore, except in the maternity hospital situa- 
tion, some form of sampling is usually required when hospital pa- 
tients form the comparison group. 

Random and systematic sampling. When collection of data on 
the frequency of factors under investigation requires extensive 
record search or inquiry of the patient, it is usual to attempt to 
assemble such data only for a sample of the comparison group 
members, rather than for all possible comparison individuals. 
Random and systematic samples are selected in such a way that 
the probability of inclusion is the same for each individual in the 
sampled frame. Such sample groups are therefore similar in com- 
position to the total group from which they were selected. They 
are used primarily when a sample of a defined population is re- 
quired for comparison with a series of cases from the same popu- 
lation. Such a sample could also be drawn from certain of the 
other sources of comparison groups discussed (for example, other 
patients attending the same institutions), but when samples of 
such groups are required the method of pairing is usually pre- 
ferred (see page 243). 

A random sample is one in which at each draw each member of 
the total group has an equal chance of being selected; for ex- 
ample, if each individual is numbered and the numbers selected 
for study are picked out of a hat or from a table of random num- 
bers. A systematic sample is one in which the group to be sampled 
is placed in some sort of order and then the selections are made 
systematically throughout the series, for example, of every sec- 
ond, hundredth, or thousandth individual. Provided the order in 
which the group is placed prior to systematic sampling is not 
highly structured with respect to some variable important to the 
study, the characteristics of a systematic sample are similar to those 
of a random sample and similar tests of statistical significance may 
be applied. The distinction between these two methods is there- 
fore not of great importance in most situations, and the choice of 
one or the other method depends primarily on the practicability 
of each in a given situation. 

The great advantage of random and systematic samples is that, 
within certain limits of confidence that can be specified statisti- 
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cally, they allow generalizations about the total group from which 
they are drawn. As will be pointed out (see page 260), this is 
particularly valuable when the sampled group is the population 
from which the patients were drawn. The same confidence cannot 
be placed in a sample that is selected haphazardly. Bias resulting 
from haphazard selection is frequent and has been discussed by 
many writers (White, 1953; Mainland, 1954). For example, hap- 
hazard selection from lists of names may be influenced by the 
length of the name (related to race), its position on the page, or 
the legibility of the handwriting (possibly related to hospital 
service and hence to diagnosis). Haphazard selection from files 
of cards or records may select those which are thicker or more 
worn and hence cases of greater medical interest. 
(A special situation that may arise is the necessity to sample 
from a sampling frame in which certain individuals are listed 
more.than once; for example, it may be necessary to sample from 
lists of admissions to hospitals; and certain individuals may.have 
been admitted on several occasions. In such a situation a random 
sample of names from the list will be weighted by persons with 
multiple admissions. It will be a random sample of admissions, 
but not of individuals admitted. Moreover, the conversion of 
such a sample to a sample of individuals admitted requires more 
than the checking of the selected sample to see that no individual 
appears more than once in it. It is also necessary to know the dis- 
tribution of individuals in the sampling frame according to fre- 
quency of appearance in the frame, and to adjust the selected sam- 
ple by making allowance for the probability of the various groups 
being selected. | 

Before applying such a correction, it is necessary to be sure that 
a sample of individuals is required. For example, among a list of 
births occurring in a hospital over a number of years, certain 
mothers will appear more than once, according to the number of 
times they have given birth in that hospital during the period. It 
is necessary to decide whether a sample of mothers is required 
(in which case the correction described above must be applied), 
or whether a sample of births is more appropriate (in which case 
the correction is not applied). When the series of patients with 
which the sample is to be compared comprises pregnancies or in- 
fants with a certain disease or characteristic, the probability of a 
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particular woman being included in the patient series is usually 
related to the number of births she has contributed to the study 
population. Consequently, if a sample of women is selected from 
the comparison population, it would, by comparison with the pa- 
tient series, be biased by the inclusion of women of low fertility. 
In such a situation, therefore, a sample of births is the more ap- 
propriate comparison group. If, on the other hand, the characteris- 
tic under study is a characteristic of the women themselves, it 
would be more appropriate to select a comparison sample repre- 
senting women. 

Paired samples. For the selection of a random sample, the total 
population to be sampled must be defined and, at least in theory, 
enumerated. This is not always possible. For example,.if the com- 
parison sample is to be drawn from persons living in the same 
neighborhoods as the patients, how far do these neighborhoods 
extend? Similarly, if the sample is to be drawn from other pa- 
tients attending the same hospital as those comprising the study 
series, it may not be desirable to wait until the total number of 
such patients is known before beginning to assemble the sample. 
In these situations, paired sampling is usually undertaken. ‘This 
involves the selection from the sampling frame of one or more in- 
dividuals for each patient admitted to the study series. Individ- 
uals are selected by virtue of some defined temporal or geographic — 
relationship to a patient, for.example, the next. patient (or next 
i pations, admitted a... a study mane : person 
living in the nearest residence to that of a study patient, the 
student next to a patient in the alphabetical class listing, and so 
on. 

Once again it is important that the selection is not haphazard. 
The method of selection must be rigidly defined and adhered to. 
For example, in_the selection of a neighborhood--pairedsample 
it will be necessary to specify whether the interviewer goes to the_ 


right or the left of the. patient’s house, what the procedure is in 
the case of two-family and apartment housés, and so on. The fact 
that nobody is at home at the time the interviewer calls is not an 
adequate reason for rejection of the selected individual, since the 
probability of all members of a family being out at any one mo- 
ment is highly correlated with~certain-demographic~ character- 
istics, for example, family size. Whatever the method of selection, 
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effort must be made to contact selected individuals, the use of 
“alternate” selectees always raising the question of whether the 
omission of the difficult-to-locate group makes the sample unrep- 
resentative. 

Matching and stratification. In any disease, a series of affected 
individuals will differ from the population. from which it was 
drawn in many demographic characteristics, It is likely that some 
of these characteristics are related to the frequency of the sus- 
pected causal factors. A difference between the patients and a 
randomly or systematically selected comparison group in fre- 
quency of such factors can therefore be expected, even if there is 
no causal relationship between the suspected factor and the dis- 
ease under study. For example, a series of patients with lung 
cancer will include proportionately more males than will a sam- 
ple drawn from the general run of hospital patients, and will 
therefore contain a higher percentage of heavy smokers than the 
general patient series. (In this particular example, the high pro- 
portion of males in the lung cancer series is due to the fact that 
lung cancer affects heavy smokers and consequently males, rather 
than affecting males primarily and heavy smokers consequentl 
However, this sequence of events will not be known prior to tle 

study, and it is necessary to examine the possibility that the pre- 
- dominance of males is due to some other cause, independent of 
smoking, and that the high proportion of heavy smokers is sec- 
ondary to the excess of males. " 

There are two general ways of dealing with this type of diff- 
culty: 

(1) Statistical techniques used in the analysis of the data 
that depend essentially on the comparison of like with like — in 
this example, of males in the patient series with males in the 
comparison group (see pages 255-259). 

(2) Sampling techniques applied to the selection of the com- 
parison group that result in a sample that is more similar to the 
patient series than would be a random sample. In the present ex- 
ample, a comparison sample could be selected that had the 
same proportion of males as the patient series; any tendency of 
males to smoke more than females should then affect both series 
equally. ‘hese techniques are commonly called stratification (when 
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random or systematic samples are being chosen) and matching 
(when paired samples are being selected). 

The method of stratification requires knowledge of the dis- 
tribution of both the patient series and the population from 
which the comparison group is to be selected according to the 
variables that are going to be equalized. The comparison popu- 
lation to be sampled is divided into subgroups according to the 
chosen variables, and different proportions are selected from each 
subgroup. For example, if the patient series contains four times as. 
many males as females, four_times.as..many.individuals.should 
be selected from the male subgroups of the comparison popula-_ 
tiowastromthefemalesubgroups = , 

The use of stratification in case history studies has some con- 
siderable disadvantages. In the first place, either because of lack 
of information or because of mechanical difficulties, it may be 
impossible or very difficult to subdivide the population to be 
sampled. Secondly, the necessity to know the final composition of 
the patient series means that a comparison sample cannot be as- 
sembled until after the patient series is complete. This may 
mean that all the comparison individuals would be interviewed 
after all the members of the patient series, which would clearly 
be undesirable. For these reasons, stratification..is.rarely.used in 
case history studies, although it is common.in.cohort studies. 

When it is decided to select a comparison sample with some of 
the characteristics of the patient series, it is usually preferable to 
use the method of paired sampling as previously described (page 
243), with the additional requirement that the member of the 
pair selected from the comparison population must match the 
corresponding member of the patient series with respect to spe- 
cified criteria.(The selected individual would then be, for ex- 
ample, not the next person admitted but the next person ad- 
mitted who satisfied the defined criteria. 

The decision to select a matched sample or to leave the allow- 
ance for differences between patients and comparison group to 
the analysis depends on a number of factors. In considerable 
part, the decision depends on what is already known about the 
characteristic under investigation. Thus, Record and McKeown 
(1949) in a study of congenital malformations of the central ner- 
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vous system compared a series of affected individuals with a sys- 
tematic sample of the related population of births, since little 
was then known about the association of these disorders with de- 
scriptive variables such as maternal age, birth order, season and 
so on. In future studies of more direct causal influences it might 
be desirable in the selection of comparison groups to match for 
some of those variables noted by Record and McKeown and later 
workers to be associated with these malformations — race, birth 
order, calendar time, season. 

Another factor is the detail of the information sought and the 
expense of its collection; adequate analysis will require much 
larger numbers of persons if the comparison sample is unmatched 
than if it is matched; therefore, if there is any advantage in re- 
ducing the number of persons for whom data must be obtained 
a matched sample is preferable. 

Lastly, certain types of comparison groups lend themselves par- 
ticularly to matching. These include hospital’ patients, relatives, 
and-neighborhood groups. Since such groups are not representa- 
tive of-any general population, the usual disadvantage of matched 
groups — that_ they. are-not-representative~of.a population. and 
cannot therefore be used as the basis for incidence..estimates.(see 
page 260) — is of less significance: 

Having decided to select a matched sample, it is necessary to 
specify the variables that will be matched, and the limits of varia- 
tion of each variable that will be considered as matching. For ex- 
ample, must age match within | year or 20 years? The criteria 
may be relatively simple. For example, in one case history study 
of the role of smoking in lung cancer, in which the comparison 
group was chosen from other patients in the same hospital, the 
sample was matched by sex and by age within 5 years (Doll and 
Hill, 1952). On the other hand, quite detailed matching criteria 
may be set up, as in a current study of the role of X-rays in child- 
hood cancer in which the comparison children, chosen from the 
neighborhood residents, are required to match for age within | 
year, race, sex, birth order and family size (National Cancer In- 
stitute, 1960). 

The choice of variables to be matched depends on much the 
same considerations as enter into the decision to select a matched 
sample. Since no examination can be made of the possible etio- 
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logic influence of variables that have been matched, clear knowl- 
edge of the role of such variables should be at hand from previous 
work. Also to be considered is the practicability of the matching 
process required by the criteria that are set up. The establish- 
ment of criteria that are difficult to satisfy leads to loss of patient 
material (since | cases that cannot be matched must be excluded), 
and perhaps to inaccuracies in the carrying out of the matching 
process. Three variables that influence the incidence of all dis- 
eases and to which a wide variety of historical events are related 
are age, sex and race. If matching is attempted, it should usually 
include these variables; beyond these, the matching process be- 
comes more difficult with each additional variable, and each vari- 
able added to the criteria requires justification. The criteria for 
considering two individuals “matched” with respect to any se- 
lected variable are determined almost exclusively by practicabil- 
ity. It is desirable to specify limits that are as narrow as possible 
and at the same time are consistent with one’s knowledge of the 
availability of sufficient individuals to match with the patient 
series. 

One other type of matching should be mentioned. This type oc- 
curs when it is necessary to select one individual from a limited 
number of available persons — for example, when siblings con- 
stitute the comparison group. In such a case the selection of one 
individual from a number of possibilities may be made in terms 
of an important variable. For example, if a patient has several 
siblings, the one nearest in age to the patient might be selected, 
but there would be no specification that the patient and the se- 
lected sibling must match within so many years of age. While 
this process tends to reduce dissimilarities between the groups, 
the patients and comparison relatives will not be completely 
matched, and it may be necessary to analyze the possible effects 
of these differences in the production of any associations noted. 


SOURCES OF INFORMATION 


The common sources of information regarding the factors 
whose influence is being examined are interviews with patients 
or with relatives, hospital records, and vital records such as 
certificates of birth and death. 
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All these sources of information are open to criticism on one 
ground or another, and it is fashionable to raise questions con- 
cerning the validity of case history studies in general on this ac- 
count. However, demand for “absolute accuracy” is quite as re- 
vealing of a lack of scientific judgment ‘as is intuitive willingness 
to accept poorly based evidence. Random occurrence of inac- 
curacy or incompleteness of data should not produce significant 
differences between study and comparison groups in a case history 
study. If, therefore, a significant difference is detected, demon- 
stration that the information used was not wholly accurate is not 
sufficient to dismiss it as meaningless. It must be shown that the 
inaccuracy was of sufficient magnitude and affected the patient 
series and comparison group differentially in such a way as could 
reasonably be expected to produce the observed difference. In 
data in which no difference has been observed between patient 
series and comparison group, the effect of inaccuracy of informa- 
tion may be more difficult to assess, since randomly occurring 
errors and omissions will have the effect of reducing real differ- 
ences between the two groups and of making the detection of 
such differences more difficult. In fact, accuracy of information 
may be more crucial in supporting a negative result than a posi- 
tive one. In most situations the major issue with respect to ac- 
curacy of information is that of obtaining the same level of ac- 
curacy in all groups compared. | 

Naturally, within the limits of the source of information used, 
efforts should be made to obtain the highest practical degree of 
accuracy. Each of the sources listed above can yield information 
of value, but the quality of information will vary from source to 
source according to the item sought. 

Hospital and vital records may be considered together. As 
noted, neither is a source of the specific information commonly 
sought in case history studies investigating explicit hypotheses. 
However, this generalization is subject to modification in the light 
of local conditions in which a variety of items of information may 
have been included in routine records for research or other pur- 
poses. In many areas maternity hospital records have achieved a 
degree of uniformity in a variety of data that allow examina- 
tion of numerous variables and hypotheses. 

Data routinely recorded in hospital or vital records may have 
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advantages over data on the same variables obtained from other 
sources, particularly if such data have been recorded prior to 
onset of the disease under investigation and therefore cannot 
have been affected by knowledge of the disease. For example, in 
a study of the birth weights of children subsequently identified 
as being mentally defective, birth certificates or hospital records 
would be superior to the mothers’ memories as sources of infor- 
mation on birth weight not only because of the ordinary deficien- 
cies of human memory, but also because these records have been 
made prior to the identification of the child as mentally defec- 
tive. 

Many events, for example, maternal infection during preg- 
nancy, if they occur, may be recorded irregularly in hospital 
records. Some positive histories for such events will therefore 
be found, but one may be sure that all the positive cases have not 
been identified. This is an example par excellence of a situation 
in which a similar degree of inaccuracy must be sought for both 
patients and comparison group. Greater reliance may be placed 
on comparisons of frequency of recording of such events if they 
are recorded without knowledge of the existence of defect — for 
example, data on maternal infections during pregnancy would be 
restricted to antenatal records made prior to time of diagnosis of 
the malformation or other defect under study. Determinations 
of the presence or absence of a causal factor made after diagnosis 
of the disease being investigated should be suspect, unless the fac- 
tor is of a highly objective nature. 

There is no more prolific source of data relative to a person’s 
past experience .than-his~or ~her-own..memory.” I he~nremories 
of his close relatives may be almost as abundant a source of in- 
formation and may be available in situations where that of the 
patient is not, for example, in studies of young children or of 
persons who have died. The most obvious uncertainty in the use 
of these sources is that although the human memory may record 
an event when it occurs, the record is by no means indelible, be- 
ing liable not only to erasure (inability of recollection) but also 
to distortion. Moreover the probability of erasure and distortion 
is related to subsequent events affecting the person as well as to 
the time elapsed since the event. If a person’s memory of trauma 
to a limb is.stimulated_by-the-knowledge-that-he.now has a bone 
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tumor in that limb, or if a mother’s memory of events that oc- 
curred during her pregnancy or during her child’s early life is 
stimulated by the fact that her child now has a serious disease, it 
will be extremely difficult to find suitable comparison individuals 
in whom a similar degree of stimulation could be expected. 
Certainly, healthy individuals will not be suitable. 

A procedure sometimes employed is to question patients as to 
the presence or absence of a certain factor and then to check by 
reference to hospital records or other objective sources those 
cases in which the factor was stated by the patient to have been 
present. The sense of security conferred by this procedure is 
illusory, since in data obtained from patients’ memories, errors 
of omission probably are more frequent and should cause greater 
concern than errors of commission. A patient will seldom, even in 
error, report the occurrence of a nonexistent event, but will fre- 
quently forget to report an event that actually occurred. This is 
illustrated in Table 37, which shows the results of a mail ques- 


TABLE 37 


Hospital record and patients’ statements as to the presence or absence 
of prenatal abdominal X-ray 


; Patients’ statement 
Hospital record 


statement X-rayed Not X-rayed Don’t know Total 
X-rayed 24 10 3 37 
Not X-rayed 2 31 5 38 
Total 26 41 8 75 


tionnaire to patients whose hospital records had been examined 
for evidence of their having been X-rayed during a specified 
pregnancy. Among patients whose hospital records did not in- 
dicate that an X-ray had been taken, only 2 patients said that 
they had had an X-ray. But, among patients who were known 
to have had X-rays, more than 25 per cent stated that they had 
not. Checking of the hospital records of the patients who said 
they had been X-rayed would not in this case have uncovered the 
major source of error. If patients’ statements are to be checked 
by reference to objective sources of information, both positive and 
negative statements should be checked whenever possible. 
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Vagary of memory is only one item among a complex of varia- 
bles, including bias by the interviewer, the circumstances of the 
interview, and so on, which may produce differences between pa- 
tients and comparison group in accuracy of response. ‘The com- 
ponents of this complex cannot always be identified separately. 
Their effect clearly varies according to the type of information 
sought, and it is not possible to make generalizations, other than 
advising the use of common-sense methods in equalizing such 
bias among the groups being studied. Any technique that can be 
used to measure the influence of this complex will more than re- 
pay the additional labor it may involve. 

For example, in a study of 1465 patients with lung cancer, 
Doll and Hill (1952) noted that the lung cancer patients differed 
from their matched comparison group (1) by giving a higher 
percentage of histories of smoking, and in particular heavy 
smoking, and (2) by reporting a higher frequency of pneumonia 
and chronic bronchitis in the past. During the course of the same 
study 335 patients with chest disease were notified to the investi- 
gators and interviewed in the belief that they had lung cancer. 
These latter patients were subsequently found not to have lung 
cancer and were excluded from the lung cancer series. Neverthe- 
less their histories were compared with those of the true lung 
cancer patients and with the comparison group (patients with 
diseases other than cancer). It was found that, with respect to 
smoking habits, the incorrectly notified group resembled the 
comparison group and not the lung cancer series, but with re- 
spect to history of pneumonia and chronic bronchitis, they re- 
sembled the lung cancer series rather than the comparison group. 
These comparisons suggested (1) that the difference between true 
lung cancer cases and comparison group in smoking habits was 
not the result of the patient or the interviewer knowing that the 
patient had lung cancer, and (2) that there is a tendency for pa- 
tients with diseases of the chest of all forms to report more 
pneumonia and chronic bronchitis than do patients with other 
diseases; this, rather than an association of pneumonia and 
chronic bronchitis with lung cancer, may be the explanation of 
the higher frequency of pneumonia and chronic bronchitis ob- 
served in the main study. 

Other procedures that are appropriate in this connection in- 


252 Epidemiologic Methods 


clude the comparison of data collected by different interview- 
ers, the comparison of histories given by different types of re- 
spondents in situations where the source of information is not 
the same for all patients, and the comparison of data on the same 
item that might be obtained from more than one source. For ex- 
ample, in a small number of cases Stocks and Campbell (1955) 
obtained histories of smoking habits of lung cancer patients both 
from the patients themselves and from their spouses. Appreciable 
differences were found but these were not sufficiently large to 
affect the major conclusions of the study. Where practical, provi- 
sion in a study design for re-interview of a sample of inter- 
viewees some time after the first interview is a valuable adjunct. 


Nonresponse 


Whatever the source of information used, there will usually be 
some patients for whom the required data cannot be obtained, 
because the patient could not be contacted, because he or she re- 
fused the information, or because the person obtaining the in- 
formation did not ask the question or did not record the answer. 
This deficiency of information goes under the general name of 
nonresponse. It is important for two reasons: (1) in gross 
amount it reduces the volume of available data, and (2) all 
groups are not affected equally, the nonrespondents nearly always 
differing in important characteristics from those for whom the 
data were obtained. As in the general question of inaccuracy of 
data, there are two practical considerations; (a) reduction of 
nonresponse to its lowest practicable level, and (b) determina- 
tion that the level and type of nonresponse are similar in the 
study group and the comparison group. The first consideration is 
perhaps the prior one, since the most satisfactory method of deal- 
ing with the second consideration is to eliminate nonresponse in 
both groups. Even if nonresponse cannot be eliminated entirely, 
the lower its level, the less its possible effect. However, it is most 
important that efforts to achieve this minimum be applied 
equally to both the study and the comparison groups. 

For example, Record and McKeown (1949) obtained data by in- 
terviews with mothers of children with congenital malformations 
of the central nervous system and with mothers of a random 
sample of births. Of the mothers of malformed children 81.2 per 
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cent, and of the sample mothers 84.7 per cent, were traced and 
successfully interviewed. For all but 2 per cent of the malforma- 
tion series data on certain items (e.g., parity) could have been 
obtained from another source (fetal death certificates), which did 
not supply data for the sample mothers. In an examination of 
parity it would not have been legitimate to compare the 98 per 
cent of the malformation series for whom parity was known with 
the 84.7 per cent of sample mothers who were interviewed, since 
the untraced malformation cases included a higher proportion of 
first births than did the traced cases. This would be expected 
from the known greater mobility of small families; but, even if 
such a relationship had not been suspected, the comparison would 
have been contraindicated. The correct comparison in such a sit- 
uation is necessarily restricted to the interviewed mothers in both 
groups. 

Some of the variables that can be expected to differ between 
nonrespondents and respondents in interview situations include 
the time lapse between identification of the study patients and 
the attempted interview, age, marital status, size of family, oc- 
cupation, economic status, and ethnic group. Even nonresponse of 
a patient due to omissions in hospital records may not occur ran- 
domly, being possibly related, for example, to private or ward 
status, service to which admitted, number of previous admissions, 
duration of stay, and so on. 


ANALYSIS 


The basis of the analysis of a case history study is a comparison 
between study group and comparison group with respect to the 
frequency of factors whose possible etiologic influence is being 
tested. This may be a comparison in a fourfold table, such as that 
in Table 38, in which the concern is merely with the presence or 
absence of the suspect factor, or it may be in the form of two fre- 
quency distributions, as shown in Table 39, where the intensity 
or duration of exposure is also considered. 

In Table 38 it may be seen that a higher proportion of the 
children with cancer than of the comparison group had a history 
of prenatal (intrauterine) X-ray exposure suggesting that there may 
have been an association between X-ray exposure in utero and 
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TABLE $8 


Distribution of 1299 children dying of cancer and of a matched com- 
parison group according to the presence or absence of prenatal 
X-ray exposure * 


Prenatal Cancer Comparison 
irradiation deaths group 
Reported 178 93 
Not reported 112] . 1206 
Total 1299 1299 


“ From Stewart et al. (1958). 
x2 = 29.8, n = 1, p<0.001 


death from cancer in childhood. Table 39 shows a considerable 
difference between lung cancer patients and a comparison group in 
the number of nonsmokers (and hence in the number of smokers); 
in addition, there is a difference within the group of smokers in 
the distribution according to number of cigarettes smoked per day. 
In both these tables, therefore, an appreciable difference exists 
between the study group and the comparison group. In the fur- 
ther analysis of these differences, at least three distinct questions 
must be asked: 


TABLE 39 
Distribution of 1465 lung cancer patients and a matched comparison 


group according to average amount of tobacco smoked daily over 
the 10 years preceding onset of the present illness* 


Males Females 
Daily average Lung cancer _ Comparison Lung cancer Comparison 
cigarettes patients group patients group 
0 7 61 40 59 
1-4 55 129 16 25 
5-14 489 570 24 18 
15-24 ake 43] 14 6 
25-49 295 154 14 0 
50+ 38 12 0 0 
Total 18557 1357 108 108 


* From Doll and Hill (1952). 
For men: x2 = 93.77, n = 4, p<0.001 
For women: x2 = 17.41, n = 3, p<0.001 
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(1) Is the difference statistically significant? 

(2) If the difference is statistically significant, can it be ex- 
plained as a secondary association resulting from other known as- 
sociations of the disease? 

(3) What is the risk estimated to be associated with the 
presence of the suspected factor? 


Statistical Significance 


The first of these questions is, of course, the prior one. Unless 
first shown to be statistically significant, few differences will war- 
rant the consideration outlined below. However, it is possible 
that even nonsignificant differences, if they satisfy some of the 
criteria indicated below, may raise sufficient interest at least to 
stimulate the collection of additional data. It should be remem- 
bered that an apparent difference may fail to be significant statis- 
tically either because no real difference exists or because the data 
are numerically insufficient. 

It is not our purpose to present material that would be more 
appropriately found in a textbook of biostatistics, although it 
must be admitted that in several areas, including the analysis of 
case history studies, the border between epidemiology and _ bio- 
statistics is ill-defined. Perhaps the commonest statistical tech- 
nique involved in case history studies is the chi-squared test for 
the comparison of distributions. For descriptions of this and other 
appropriate procedures the reader is referred to such texts of 
biostatistics as those of Hill (1948), Bancroft (1957) or Croxton 
(1953). A recent article by Mantel and Haenszel (1959) out- 
lines the specific application of the chi-squared test to case history 
studies. 


Secondary Associations 


The second question to be raised is whether the difference be- 
tween study group and comparison group is secondary to a dif- 
ference between the two groups in some important characteristic, 
particularly some major demographic characteristic such as age, 
sex or race, whose influence on the disease is already recognized. 
It has already been noted in this chapter that to a certain extent 
such differences may be reduced in the study design by match- 
ing the comparison group to the cases with respect to variables 
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that are thought likely to complicate consideration of the vari- 
ables under examination. It will, however, be unusual for all such 
variables to have been matched, and almost inevitably some an- 
alytic procedures will be necessary to examine the effect of 
variables that were not matched. 

There are two problems: (1) the identification of the variables 
that are likely to produce secondary associations between the dis- 
ease and the factors under investigation; and (2) examination 
of the data to determine whether in fact such variables are re- 
sponsible for the association observed. 

The identification of such variables depends in the first place 
upon knowledge of what is known from previous studies about 
the natural history of the disease in question and about the distri- 
bution of the suspected factor in the general population. Any 
variable that is known to be strongly associated with either the 
disease or the suspected factor must be considered as a possible 
cause of association between the disease and the suspected factor. 
In addition, the data of the study itself must be examined for 
evidence of associations that were not previously recognized or 
that are peculiar to the data of this particular study. This exami- 
nation should involve two procedures: 

(1) Comparison of study group and comparison group with 
respect to the major demographic variables such as age, sex and 
race, and such other variables for which data have been collected. 
Even variables that have been matched in the selection of the 
comparison group may require comparison if the matching cri- 
teria have not been absolute: For example, if the comparison 
group members were selected by virtue of being within 5 years of 
age of the corresponding patients, it is conceivable that there may 
be a consistent bias toward the selection of a comparison group 
whose members may on average be up to 5 years older than the 
patients. Where differences are found, the possibility that these 
reflect conditions that might influence the frequency of the sus- 
pected factor must be considered. For example, in the Doll and 
Hill study shown in Table 39, patients and comparison group 
members were matched for age and sex. In addition, they were 
compared with respect to social status, occupation, residence near 
a gasworks and type of house heating. The fact that the two 
groups were remarkably similar in all these respects makes the 
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marked difference in smoking habits more striking, and makes 
less tenable a hypothesis of the association of lung cancer and 
smoking being due to a third variable. On the other hand, in the 
same study differences were noted between patients and controls 
with respect to frequency of previous respiratory illness, and it 
was thus necessary to examine the possible effect of this difference 
on the original hypothesis. In some studies, certain factors are in- 
cluded not for their possible relevance to the etiology of the dis- 
ease under study, but to enable factual statements to be made 
about areas of general similarity shared by the study group and 
the comparison group. 

(2) Study of the patterns of distribution of the suspected 
causal factors separately within subgroups of both the study and 
the comparison groups. This procedure will produce evidence as 
to which variables are most likely to influence the frequency of 
the suspected factor, and should therefore receive most attention 
in comparisons of patients and comparison group. 

When attention has been drawn to a particular variable being 
possibly productive of secondary associations within the data, it 
remains to be seen whether the variable does in fact explain the 
difference between the compared groups that is of primary inter- 
est. The statistical procedures involved here may be highly com- 
plicated, but the basic examination is a comparison of the study 
and comparison groups with respect to frequency of the suspected 
factor within limited ranges of the complicating variable. 

For example, Lowe (1956) compared the smoking habits of 
patients with tuberculosis with those of minor accident patients 
and general surgical patients, unmatched with respect to age or 
sex. Since smoking is markedly associated with both these vari- 
ables, it was necessary to determine whether a difference noted 
between the smoking habits of the patients with tuberculosis and 
those of the patients in the comparison group was due to differ- 
ences in age or sex distribution of the groups. ‘The data are shown 
in Table 40. For simplicity, only the percentages of smokers of 
20 or more cigarettes a day are shown, although, obviously, total 
distributions according to amount smoked could also be com- 
pared in each age and sex category. ‘The table shows that for each 
sex, in each 10-year-age category examined, the percentage of 
heavy smokers was higher among the tuberculous than among 
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the general patients. ‘The difference between the tuberculous pa- 
tients and the comparison groups in smoking habits cannot, 
therefore, be explained by differences in their age or sex composi- 
tion, since the difference in smoking habits remains when age and 
sex are held constant. Another example of this procedure was 
given in Chapter 12 (see ‘Table 31, page 202) in the examination 
of the independent effects of birth order and maternal age on the 
incidence of infantile pyloric stenosis. 


TABLE 40 


Percentage of smokers of 20 or more cigarettes a day among patients 
with tuberculosis and a comparison group, in age and sex groups* 


Percentage smoking 20 or 


Number of patients more cigarettes a day 
ee Males Females Males Females 

(years) Pte a, ee 

‘Tuber- ‘Tuber- ‘Tuber- ‘Tuber- 

culous Other culous Other culous Other culous Other 
30-39 211 126 130 113 48.3 46.0 9.2 yay 
40-49 164 14] 36 94 51.8 44.6 19.4 3.2 
50-59 104 103) 51.0 41.8 

19 4 10.5 0.0 

60+ 4] 49 f : 51.2 36.8 
30+ 520 419 185 249 50.1 43.4 11.4 2.4 
30 ++ — — — — 50.3 43.7 12.4 2.5 


* Data from Lowe (1956). 


+ Standardized to the age distribution of the combined cases and comparison 
group. 


In conducting examinations such as these, the method of group- 
ing is important. The subgroups set up must contain a sufficient 
number of cases for separate examination of each subgroup, but 
they must not be so broad that the desired similarity between com- 
pared series is not obtained. For example, if a disease is markedly 
related to age, the use of 20-year-age groups may not eliminate the 
effect of age, since within 20-year-age groups there may be suffi- 
cient difference between two series to produce appreciable differ- 
ences in disease incidence. It may be difficult to decide whether 
the adopted grouping has been sufficiently fine to effectively 
equalize the compared series. If the apparent difference between 
compared groups is eliminated by holding another variable con- 
stant, it may be assumed that the grouping was satisfactory. How- 
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ever, if the apparent influence of the variable is reduced but not 
eliminated, the effect of finer grouping should be ascertained. If 
finer grouping is impractical because of inadequate numbers, it 
may be possible to estimate by indirect methods the maximum 
possible influence of variation within the chosen groups by mak- 
ing assumptions about the maximum extent of variation possible 
within the adopted grouping and the maximum change in inci- 
dence that might be associated with it. Some general idea about 
the possible effect of finer grouping may also be obtained by not- 
ing the change in the crude data that results from examination 
of relatively broad groups, as discussed in the next paragraph. 

In addition to visual inspection of the differences between the 
study and comparison groups in each subdivision of the data, it 1s 
usually desirable to test the statistical significance of the differ- 
ences and to obtain some sort of summary statement that will ex- 
press the sum of the differences noted in each subdivision — in 
other words, the total difference between the study and compari- 
son groups that exists independently of any difference attribut- 
able to the complicating variables. The significance test that will 
be most often appropriate is the cumulative chi-squared test, as 
described in relation to this problem by Mantel and Haenszel 
(1959). A simple statistical technique that may be used to obtain 
a summary statement of the difference between groups after the 
effect of a complicating procedure variable has been eliminated 
is one or other of the standardization techniques that are com- 
monly used in adjustment for age differences between compared 
tabulations. Standardization of the data in Table 40 to the age 
distribution of the combined cases and comparison group results 
in the percentages of smokers of 20 or more cigarettes a day 
shown in the bottom row of the table. These figures are little dif- 
ferent from the crude totals before standardization, which sug- 
gests that an even finer subdivision of age groups than the 10 year 
periods used would not appreciably affect the results. 


Estimated Risk 


The third question in the consideration of a difference noted 
between the study and comparison groups concerns the size of the 
difference — how great a risk is apparently conferred by exposure 
to the factor for which the difference is noted. In the previous 
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chapter, it was noted that such a risk may be expressed in two 
ways: 

(1) Relative risk—the risk (rate) in exposed persons di- 
vided by the risk in the nonexposed group. 

(2) Attributable risk — the risk in exposed persons minus 
the risk in the nonexposed group. 

In estimating these risks in case history studies it is necessary to 
distinguish clearly between studies in which the cases can be re- 
lated to a defined population and those in which the cases have 
been assembled without reference to a defined population. 

Studies of cases referable to a defined population. When the 
cases represent all cases of the disorder in a population or are a 
sample of such cases selected in a known way and the comparison 
group is representative of the same population, it will be possible 
to estimate separate rates of occurrence of the disease in exposed 
and nonexposed persons and to derive relative and attributable 
risks from these estimates. 

For example, the cases of childhood cancer shown in Table 38 
(page 254) were 77 per cent (all those for whom data could be ob- 
tained) of the 1694 children under 10 years of age who died of can- 
cer in England and Wales in the years 1953 to 1955. The compari- 
son children were selected from birth certificates in the same 
locality in which the deaths of the cancer patients occurred and 
were matched for age and sex. The assumption may be made that, 
with respect to frequency of prenatal X-ray irradiation, the com- 
parison group was representative of the total number of births that 
resulted in the population of children aged less than 10 years in 
the period 1953 to 1955. Since 7.2 per cent of the comparison 
sample population received prenatal X-rays, it is assumed that a 
similar proportion of all children alive in the population at risk 
received prenatal X-rays. The average population of England and 
Wales aged less than 10 years in 1954 was approximately 6,999,000. 
The population at risk may therefore be supposed to consist 


of approximately 6,999,000 ~« Ab — 503,900 children who were 
X-rayed prenatally and 6,999,000 x a == 6,495,100 who were 


not. By applying to these populations the numbers of cancer cases 
found to have been X-rayed (178) and those not X-rayed (1121), 
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one can derive death rates in the X-rayed and the non-X-rayed 
population during the 3 year period. Division by three yields aver- 


age annual death rates of 509.000 SZ >< 10* = 118 per million in 
1121 ee 

he X-rayed, and —-_______ e 

the X-rayed, an 6,495,100 <3 x 10 58 per million in the 


non-X-rayed children. If it is assumed that the 395 untraced cancer 
cases were distributed by X-ray status in the same way as the traced 
cases, these rates become 154 and 75 per million, respectively. 

Studies of cases not referable to a defined population. The data 
in Table 39 (see page 254) cannot be related to a population. ‘The 
lung cancer patients were patients attending selected hospitals and 
no attempt was made to limit the group according to place of resi- 
dence or to determine what proportion these represented of the 
total lung cancer patients in the areas served by the hospitals. 
Similarly, the comparison group was selected from patients with 
other diseases in the same hospitals, and although it was therefore 
a sample of the same population from which the lung cancer pa- 
tients were drawn, it is not known what proportion of that popu- 
lation they represented. 

Since neither the population at risk nor the numerator of the 
rate is known, it is impossible to derive separate estimates of in- 
cidence in the exposed and the nonexposed populations; it 1s 
therefore impossible to derive the risk attributable to exposure. 
However, it is possible to derive a relative risk for the following 
reasons. It was noted in Chapter 4 that a theoretical population 
can be divided among the cells of a fourfold table as follows, 
where a, b, c and d represent the frequencies of individuals in the 
various cells. 


Disease 
Suspected 
cause Present Absent Total 
Present a b a+b 
Absent g d c+d 
Total a+c b+d ] 


ener 


In a case history study, only derivatives of such frequencies are 
available, and the numbers of individuals in each cell may be de- 
noted as a’, b’, c’, and d’. 


262 Epidemiologic Methods 


Historical Comparison 
episode Cases group 
Present a’ ee 
Absent c’ d’ 
Total a’+c’ b’ +d’ 


The right-hand column has been omitted since, as noted, it can- 
not be derived in a case history study unless it is known what pro- 
portion of (a+ c) and (b + d) are represented by (a’ + c’) and 
(b’ + d’) —in other words in what proportion to weight the cases 
and comparison group columns to derive an estimate of total pop- 
ulation. 

From the first diagram showing the theoretical distribution of 
the population, one can see that the risk in exposed persons rela- 
tive to the risk in those not exposed would be: 


incidence of the disease in 
exposed persons Ee OLE see 
incidence Ge the disease in @=LB lead cla toe 
nonexposed persons 


When the number of persons affected by the disease under in- 
vestigation is small compared to the number unaffected (the 
usual situation in disease studies), d is approximately equal to 
c +d, and b is approximately equal to a+ b. The formula for 


; : ad 
relative risk then reduces to oa 
C 


When the cases studied represent only a sample of all cases in the 
population, and the comparison group is a sample of the popula- 
tion at risk, then a’ — as,, c’ = cs,, b’ = bs. and d’ — dss, where 
5; 1s the sampling fraction used in selecting the cases and s, is the 
sampling fraction used in selecting the comparison sample from 
the population. The formula becomes: 


a’s;d’So—s aa’d’ 


Relative risk — — —__. 
CSO" Se. 67D 


It is clear therefore that for the derivation of relative risk it is not 
necessary to know the values s, and sp. 

For example, from the data in Table 38 (see page 254) one can 
calculate that the risk of death from cancer for children reporting 
178 x 1206 


tal irradiati creer saan ae 
prena al Irradiation was 93 x 1191 


= 2.0 times as great as for 
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those not reporting such irradiation. This is consistent with the 
estimates calculated earlier in the chapter of death rates of 154 and 
75 per million in exposed and nonexposed children, respectively. 
From the data in Table 39 in which the values s; and sz are not 
known, the relative risk for smokers (any amount) compared to 
1350 X 61__ gy 

7 x 1296 

The use of the above formula involves two assumptions: (1) that 
the disease under study is infrequent; and (2) that neither the 
patient series nor the comparison group is selected in favor of 
either exposed or nonexposed individuals — that is, that the frac- 
tions s; and s. apply equally to exposed and nonexposed indi- 
viduals in the cases and the comparison group, respectively. If the 
first of these assumptions is open to question, the formula still has 
use since the estimate derived is a function of relative risk. 

If the design of the study is inconsistent with the second as- 
sumption, however, a derived relative risk is necessarily mislead- 
ing. For example, the relative risk of lung cancer in relation to 
smoking has been found to be generally smaller in studies in 
which hospital patients formed the comparison sample than in co- 
hort studies in which the comparison group derived from the gen- 
eral population. Menaker (1956) has pointed out that the re- 
ported proportion of nonsmokers among hospital patients 1s 
smaller than among the population at large, presumably because 
a number of diseases other than lung cancer are positively asso- 
ciated with smoking. Thus, in case history studies in which hos- 
pital patients are used as a comparison group, the proportion 
sampled from the general population may be higher for those ex- 
posed (smokers) than for those not exposed (nonsmokers). In 
this case the error is in favor of understatement. The relationship 
to smoking is so much stronger for lung cancer than for other 
diseases that the bias did not obscure the relationship. However, 
under other circumstances such a bias could readily operate in the 
other direction. 

Adjusted relative risk. We have considered a relative risk de- 
rived from a simple fourfold table containing a summary of the 
study data. This must be considered a crude relative risk, since no 
allowance has been made for the fact that some or all of the ap- 
parent association noted may result from indirect associations with 


nonsmokers would be 
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variables such as age, whose influence has not been considered in 
the total data. As noted previously, the possible effect of such 
variables must be examined by classification of the data to 
allow comparison of cases and comparison group within defined 
limits of the complicating variable. Within each subclassification 
a’d’ 

cb? 

For example, in Lowe’s data for males shown in Table 40 (see 
page 258), the relative risks for smokers of 20 or more cigarettes a 
day are 1.10, 1.33, 1.45 and 1.80, respectively, in the four age 
groups shown. ‘The problem then is to derive an over-all relative 
risk which is a summary of the relative risks estimated within each 
of the subclassifications. One procedure is to apply the formula to 
the total distribution after standardization. For example, using the 
standardized data for males in the table, the relative risk is 
50.3 >< 56.3 
43.7 « 49.7 
ization is small, the relative risk being practically identical with 
that which might be calculated from the total crude data for males 
— 1.31. ‘The estimated summary relative risk may vary somewhat 
according to the population used for standardization. Other 
methods of deriving summary relative risks have been described 
by Mantel and Haenszel (1959). 

Multiple grouping of study variables. We have so far con- 
sidered only situations in which the classification of the variables 
under study is dichotomous, such as in Table 38 and Table 40. 
Clearly, the setting up of an artificial dichotomy at a certain level 
of exposure such as 20 cigarettes a day, as in Table 40, underesti- 
mates the relative risk that is associated with the factor considered. 
A more realistic estimate of the importance of the factor is ob- 
tained by comparing those experiencing minimal exposure with 
those maximally exposed, for example, in ‘Table 39, by compar- 
ing nonsmokers with those smoking 50 or more cigarettes a day. 
a’d’ 
b’e? 
— it is not necessary that c’ and d’ be the total number of cases 
minus a’ and b’. For example, for the data for males in Table 39, 


a relative risk can then be calculated by the formula 


= 1.30.* In this particular case, the effect of standard- 


The formula can be applied separately to any exposure level 


* Note that the formula can be applied to percentages as well as to actual num- 
bers. 
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the relative risk of lung cancer in smokers of | to 4 cigarettes com- 
pared to nonsmokers is ea — 3.7. Likewise, the relative risk 
to smokers of 5 to 14 cigarettes compared to smokers of | to 4 is 
489 x 129 


55 & 570 
for adjacent categories of smoking. These are shown in the column 


labeled (a) of Table 41. In this particular study it adds greatly to 
the strength of the hypothesis that the relative risk is greater than 
1 in each comparison of a category of smoking with the lower ad- 
jacent category. 


— 2.0. A series of relative risks can therefore be derived 


TABLE 41 


Relative risks as derived from the data for males in Table 39 


Relative risk 


Daily average Lung cancer Comparison 

(cigarettes) patients group (a)* (b)+ 
0 7 61 aa 1.0 

1-4 55 129 | | 

5-14 489 570 2.0 iA 

15-24 475 431 lL 9.6 
25-49 200 154 sy | 16.3 
50+ 38 12 I 


* Relative to persons in the adjacent lower category of smoking. 
+ Relative to nonsmokers. 


Since smokers of 1 to 4 cigarettes have 3.7 times the risk of non- 
smokers, and smokers of 5 to 14 cigarettes have 2.0 times the risk 
of smokers of 1 to 4, it follows that smokers in the 5 to 14 category 
have 2.0 * 3.7 = 7.4 times the risk of nonsmokers. Therefore, by 
accumulative multiplication, a series of risks in each exposure 
eroup can be calculated relative to a risk of 1.0 in the lowest 
eroup. This series is shown in the table in the column labeled (b). 
The risk of 27.7 for the highest exposure group (50 or more ciga- 
rettes) relative to the lowest (nonsmokers) could also have been de- 
rived by direct calculation for the two groups (38 X 61)/(7 & 12). 

In further illustration of the effect of using a dichotomy, as, for 
example, at 20 or more cigarettes a day in the data for respiratory 
tuberculosis in Table 40, we may note that, compared with a rela- 
tive risk of 1.3 derived by comparing smokers of fewer than and 
more than 20 cigarettes a day, in the same study the relative risk 
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in the highest exposure group (40 or more cigarettes a day) com- 
pared to the lowest (nonsmokers) was 5.3. 


INTERPRETATION 


At the termination of a particular case history study, it will be 
possible to determine whether or not in the data of the study in 
question a significant association exists between the variables per- 
tinent to the hypothesis being tested and the disease under inves- 
tigation. The attainment of this objective, however, serves a 
useful purpose only to the extent that the findings of the particu- 
lar study are indicative of the existence of an association between 
the variables and the disease capable of generalization to all situa- 
tions similar to those of the study. Such generalization appears 
most permissible when the study has been designed and con- 
ducted in such a manner that it seems reasonable to assume the 
following: 

(1) ‘he cases studied are indeed representative of cases of 
the disease in question. 

(2) The comparison group reflects with accuracy the rele- 
vant characteristics of the general population. 

(3) ‘The sources of information indicate the past experience 
of the cases and comparison group members with a high degree 
of precision. 

(4) ‘The observed differences between cases and comparison 
group have been analyzed with sufficient rigor to indicate their 
probable causal as well as statistical significance. 

It is not, of course, necessary that these assumptions be accepted 
or rejected in toto. As pointed out previously, data containing er- 
rors or biases need be rejected only if such errors or biases are of 
sufficient magnitude and operate in a direction which could ex- 
plain the observed patterns. Similarly, no series of cases can be 
said to be fully representative of all cases of the same disease, but 
in many situations a specific series will be sufficiently representa- 
tive to allow conclusions in specific areas. Likewise, no compari- 
son group will be truly representative of the general population, 
although in a well-designed study it will not be unrepresentative 
in any relevant respect. 


In addition to assessment of the reliability of the data from a 
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particular study, a number of more general questions enter into 
the interpretation of the findings. These include the way in 
which the observations fit into the pattern of existing knowledge 
of the disease, the consistency of the observations between sub- 
groups of the data (e.g., is the observed relationship observed in 
both males and females?), and the extent to which the observa- 
tions are confirmed in other studies. A new observation need not 
necessarily be explainable through mechanisms consistent with 
existing knowledge, but most new observations that have proved 
useful have not seemed ridiculous in the light of existing knowl- 
edge. Also, existing knowledge helps to put a newly observed 
relationship in perspective. For example, a relationship respon- 
sible for doubling the risk of a disease will have limited signifi- 
cance if much stronger relationships with other variables are al- 
ready known. On the other hand, the minor difference in the 
proportion of males among first births (51.7 per cent) and 
among sixth and later births (51.3 per cent) is of considerable 
interest, because stronger relationships to the sex ratio of human 
births are not known. 

Perhaps the most convincing evidence for the ability to make 
a generalization from an observation comes from replication of 
results by other studies. When varied methodologies result in 
similar conclusions, increasing confidence can be placed in gen- 
eralization from the observations. 

An additional purpose in case history studies will usually be 
to determine the likelihood that any statistical associations noted 
are causal in nature. (For a discussion relative to these inferences 
see Chapter 2.) However, it may be noted here, that, despite many 
statements to the contrary, the difficulties of inferring cause from 
observed association are not peculiar to case history studies; they 
are equally evident in cohort studies. ‘The specific difficulties, as 
indicated in Chapter 2, are characteristic of the general field of 
observation (as opposed to experiment) and are to be found in 
all observational studies, regardless of method. 


15 
Experimental Epidemiology 


Up to this point we have been dealing with observational 
methods exclusively. As noted in Chapter 2, the most rigorous 
test for causal association is that of experiment — the artificial 
introduction of an attribute or treatment in an attempt to alter 
the course of nature. Such experiments in the field of the natural 
history of human disease range from field trials of vaccines and 
other agents in the prevention of disease to clinical trials of thera- 
peutic procedures that hopefully have a favorable influence on the 
course of disease after it has arisen. Usually, the epidemiologist is 
more concerned with experiments dealing with etiology rather 
than therapy, and with trials involving communities or large pop- 
ulations rather than those restricted to ill persons in hospital beds. 
Our remarks are therefore concentrated in the area of trials of 
preventive measures. 


UNCONTROLLED EXPERIMENT 


It was noted in Chapter 4 that the application of any public 
health measure may be considered an experimental test of the 
validity of knowledge of disease etiology. F igure 37 demonstrates 
the trend of notifications of diphtheria and whooping cough in 
Massachusetts during this century. Diphtheria immunization was 
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FicurE 37. Reported cases of whooping cough and diphtheria in Massachu- 
setts, 1909-1959. D.T.P.: diphtheria-tetanus-pertussis vaccine. 


introduced with single antigens about 1920, while systematic 
whooping cough vaccination started in the late 1940’s with the 
introduction of the combined diphtheria-tetanus-pertussis vaccine. 
Both diseases show striking decreases in frequency beginning 
about the time of the introduction of the respective specific meas- 
ures. Patterns of this kind are often accepted.as a valid demon- 
stration of the effectiveness of a public health measure. It is 
interesting to note that diphtheria immunization was introduced 
without controlled experiments in human populations, while per- 
tussis immunization was introduced only after rigorous field 
trials had demonstrated the effectiveness of the vaccine. The 
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downward trend in diphtheria incidence is therefore virtually the 
only evidence that exists for the effectiveness of diphtheria vac- 
cine, while the downward trend of whooping cough incidence is 
an added confirmation of results obtained by more rigorous ex- 
perimental methods. 


CONTROLLED EXPERIMENT 


Experimental epidemiology in the modern sense is a cohort 
study in which the difference between cohorts is in respect to 
some artificially induced characteristic. Where only one cohort 
has the induced characteristic, it is common to speak of the co- 
hort with the characteristic as the “experimental group” and the 
one without as the “control group.” A study may have several ex- 
perimental groups, and the control group may be another form 
of experimental group in which the induced characteristic differs 
in kind or amount from that being tested. The essence of a well- 
designed trial is that the various experimental and control groups 
do not differ significantly from each other except with respect to 
the attribute under test. 


Selection of Study Populations 


For a field trial the selection of a population and the determina- 
tion of its necessary size depend on prediction of incidence of dis- 
ease. In a clinical trial prediction of prognosis is required. The 
expected incidence (of disease or of unfavorable outcome) to- 
gether with estimates or guesses as to the differences that might 
be expected between the compared groups is the base for de- 
ciding on the size of the groups that will be required for the 
demonstration of statistically significant differences. Since pre- 
dictions both of incidence and of differences between groups must 
of necessity be crude (otherwise the trial would not be required), 
it is usual to include considerably more persons in the study than 
Statistical tests indicate to be the minimum required for confi- 
dence in the significance of predicted differences. 

The more complete the epidemiologic knowledge of the deter- 
minants of the incidence and the course of disease, the more satis- 
factory the selection of population groups for inclusion is likely 
to be. A primary guide in the selection is the likelihood of ob- 
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serving a high incidence of disease in persons representative of 
the chosen population. Of no small importance, however, are ad- 
ditional factors such as the facilities available for observing such 
groups and the accessibility of individuals for detailed study. The 
latter considerations are of importance, for example, in selection 
of a geographic area suitable for field trials of measures directed 
against those infectious diseases that are predominant in countries 
with limited health resources. In spite of high frequency of the 
disease in one region, the lack of medical facilities is a considera- 
tion that may lead to preference for another where the expected 
incidence may be lower but where medical and public health re- 
sources permit precise diagnosis and adequate follow-up. 


Study Design 


An early field trial (1934 to 1937) serves as an example to illus- 
trate some pitfalls in design that have led to demands for ran- 
domization of study groups and more rigorous conduct of observa- 
tion. Kendrick and Eldering (1939) organized a trial of pertussis 
vaccine that, in spite of its theoretical drawbacks, elevated this 
prophylactic from a status of disputed and obscure significance to 
membership among the most useful of public health measures. 
The vaccine was offered on a voluntary basis to the preschool 
population of Grand Rapids, Michigan, through public immuni- 
zation clinics. All children of that city were listed beforehand by 
several sociologic strata, and for each child that entered the vac- 
cination roster a child of the same age and social stratum was se- 
lected for observation as an unvaccinated control. Table 42 shows 
attack rates among vaccinated and nonvaccinated children in 
three levels of exposure. As is evident from the difference in the 
totals of vaccinated and nonvaccinated children, the “matching” 
procedure was not exact. 

The weaknesses in this study design are, first, that the subse- 
quent observation of disease could be influenced by the knowl- 
edge of previous treatment and, second, that volunteering for 
treatment may be an attribute directly associated with determi- 
nants of general hygiene, disease awareness, and so on, which can 
affect the inception and outcome of disease. 

The extensive field trials of pertussis vaccine conducted in 
England by the Medical Research Council from 1946 to 1950 
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TABLE 42 


Incidence of whooping cough in a field trial of vaccines, Grand Rapids, 
Michigan, 1934-1937 * 


EEE 


Vaccinated Not vaccinated 
Exposure group Total Affected Per cent Total Affected Per cent 
Exposed in own 
household 83 29 34.9 160 143 89.4 
Exposed in other 
household 100 5 5.0 61 34 55.7 
Indefinite exposure 114 4 3.5 52 10 192 
No known exposure 1518 14 0.9 2124 161 7.6 
Total 1815 52 2.9 2397 348 14.5 


* From Kendrick and Eldering (1939). 


(1951) were designed to eliminate these sources of uncertainty. 
Inclusion in this study was based on the free choice by the parents, 
but at the time of election to enter the study the participants 
were informed that the children would be given either pertussis 
vaccine or an ineffective placebo. The vaccine and placebo were 
packaged identically, distinguished only by code numbers un- 
known to the participating centers, and were distributed in a 
random fashion. Consequently, neither the participants nor the 
professional personnel conducting the trial knew which children 
had received the vaccine and which the placebo. Identification of 
the treatment remained in code until the follow-up observations 
were concluded. After the code was disclosed the randomization 
was found to have been almost complete with respect to numbers 
of children, distribution of home and other exposures, and other 
characteristics of the children. For all exposure categories attack 
rates were about five times higher in the control group than in the 
vaccinated group. : 

This study is an example of a method which is sometimes 
called the “double-blind” technique. This phrase implies (1) 
that the treatment is identified when administered only by a 
code, the meaning of which is unknown to those giving and re- 
ceiving it, and (2) that assessment of outcome is also made with- 
out knowledge of the treatment given, or in fact of whether any 
treatment has been given. It is clear that either one or both of 
these objectives may be unattainable in particular situations. 
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Thus, the treatment under test may be one whose administration 
involves pain or considerable inconvenience to the patient and it 
may therefore be undesirable to impose similar inconvenience 
on persons who can derive no possible benefit. Even so, however, 
it may be possible to fulfill the second objective by having assess- 
ment of outcome made by persons other than those administer- 
ing the treatment and under conditions in which they cannot 
identify the group to which the persons assessed belong. Where 
“blind” assessment is not possible, reliance must be placed on the 
selection of criteria of outcome that are as objective as possible. 

Another important characteristic of the Medical Research 
Council vaccination study was the randomization of study groups. 
It is not necessary that the study groups be picked at random 
from the population. On the contrary, it is most useful to select 
groups according to determinants of expected incidence. ‘The 
randomization is equally useful when it occurs after the decision 
has been made that this particular person should and will enter 
the study. The procedure of randomization must be prearranged 
with coded ampoules or sealed envelopes containing the assigned 
treatment. Other methods, such as assigning on the basis of date 
of admissions, first letter in names, and so on, are undesirable be- 
cause the treated persons are identifiable and unconscious manipu- 
lations are possible. 

Like the ‘‘double-blind” technique, randomization is also not 
always practical. For example, in testing the effect of addition of 
fluoride to drinking water on the frequency of dental caries the 
practical problems of administration of the treatment require 
that the treatment be either given or not given to entire popula- 
tions having single sources of drinking water, with no possibility 
existing of random allocation within those populations. Since 
many persons will necessarily be involved in such an experiment, 
it will also be necessary that the experiment be limited to two or 
at most very few populations. Validity of results will then depend 
on (1) careful selection of experimental and control populations 
so that they are as similar as possible in all conceivably relevant 
characteristics, (2) detailed comparison of experimental and 
control populations with respect to the outcome to be assessed 
(e.g., dental caries) before introduction of the treatment, and 
(3) assessment of outcome by observers who are unaware of the 
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population to which the assessed persons belong. ‘These criteria 
are fulfilled and well illustrated by the fluoridation experiments 
undertaken by the U. S. Public Health Service (Ast et al., 1956; 
Arnold, 1957). 


Sequential Designs 


In a classic “double-blind” experiment, the code may not be 
disclosed before observations are complete on a group whose 
size was determined at the outset of the experiment, and some- 
times on the basis of incomplete information. The investigators 
thereby expose themselves to the dilemma that even though sig- 
nificant results may have been obtained halfway through the ex- 
periment, because of superior effectiveness of the treatment under 
test, they will not have known about it. If this is the case, an un- 
due number of “control” persons will have been jeopardized by 
not receiving the treatment. This dilemma mostly pertains to 
clinical trials where patients are admitted over a long period. 
The problem of an untoward effect of the treatment also exists; 
the investigators may be faced with the risk that the treatment 
is not only ineffective but even noxious. To safeguard against 
this it is often suggested in clinical trials that the code be re- 
vealed to the physician in charge in the event of medical emer- 
gencies, because of his professional right to know all facts about 
his patient. Perhaps this clause in the design is used much less 
frequently than one would assume, since the untoward effect 
may not show up dramatically in one patient as much as by group 
comparison. For such reasons, the current tendency is to develop 
designs for clinical trials based on the principle of sequential de- 
sign. This means that the trial is interrupted as soon as statistical 
significance has been obtained in one direction or the other. 
There is no predetermined group size. This does not necessarily 
conflict with the “double-blind” principle because the decision to 
stop the trial for statistical reasons can be made by a member of 
the study group who is not participating in the clinical work but 
makes his decision on the basis of results submitted to him se- 
quentially. 

Another design, sometimes called the semi-sequential design, 
requires that the code be disclosed and a new one substituted each 
time the study of a predetermined small number of subjects is con- 
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cluded. This number may, for instance, be 20 subjects, so that the 
sequential analyses are made on consecutive sets of 20 and the de- 
cision to interrupt is made when statistical significance is obtained. 


Multiple Field Trials 


Field trials are self-limiting by nature. If a well-conducted trial 
shows the beneficial effect of a treatment or prophylactic meas- 
ure, ethics demand that further trials cannot be conducted with 
untreated controls and any future demonstration of the principle 
involved in the treatment must consist of a comparison between 
the treatment established in the first field trial and suggested 
improvements over this treatment. One could speak of the phase 
1 trial where the “control” is no treatment; such a trial is justified 
when the efficacy of treatment is unknown. A phase 2 trial would 
be one where treatment in some form is known to be effective 
but where other forms of the same treatment might prove more 
effective. The Medical Research Council (1956) conducted a 
second trial of pertussis vaccine based on this principle. ‘The trial, 
which took place from 1951 to 1954, was concerned with the rela- 
tive efficacy of fourteen vaccines of different manufacture. All of 
the 19,000 participating children were vaccinated with a pertussis 
vaccine but the field trial showed significant differences in the 
attack rates among children inoculated with the different vac- 
cines. These attack rates correlated conclusively with the relative 
potencies of the individual vaccines as measured in assays in 
laboratory animals. 


Trials and Practice 


Because of the limitation on the number of factors that can be 
varied in costly field trials, the interpretation of results in terms 
of general applicability may be limited. In addition there may be 
an inability to translate a method successfully applied in a field 
trial into a prescription for practical use. An outstanding example 
of the latter was consequent on the field trial of gamma-globulin 
for the prevention of poliomyelitis conducted by Hammon et al. 
(1952). This well-planned trial demonstrated that injection of 
human gamma-globulin led to a significantly lower incidence of 
paralytic poliomyelitis than that experienced by placebo-injected 
controls if the injections were close to time of known exposure to 
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infection. ‘The interest of the trial was primarily to test the hy- 
pothesis that specific antibodies are important determinants in 
the inception of paralytic poliomyelitis. As a result of that finding 
hundreds of thousands of children in the United States were in- 
jected with gamma-globulin during the 1953 poliomyelitis season, 
but without any demonstration of resulting decrease in incidence 
of the disease. Presumably the lack of effect was due to an inabil- 
ity to apply generally the rigid criteria that were used in the trial. 

In a field trial of poliomyelitis vaccine in 1954 (Francis et al., 
1957), school children aged 5 to 9 years were selected solely 
because expected incidence was highest in that age group and be- 
cause of the facility of enlisting school children in an epidemio- 
logic study. The successful results of the trial led to immediate 
use of the vaccine, but it is quite conceivable that the focus on 
school children in the trial led to more extensive use of the vac- 
cine in this age group than in the preschool child. At least, the 
situation five years later, in 1959, showed that the children of 
school age were immunized in a significantly higher proportion 
than the preschool children. This fact was reflected in an attack 
rate almost three times higher in the preschool children than in 
the younger school children in 1959. 


“NATURAL” EXPERIMENTS 


From time to time, circumstances arise without human intent 
that have many of the features of experiment. Although lacking 
the human intervention that makes an experiment an experiment, 
these circumstances are frequently referred to as “natural exper- 
iments.” 

An example of a “natural” experiment reported by Hyge 
(1947) concerns an outbreak of pulmonary tuberculosis in a day 
school for girls. The epidemic took place 2 months after the whole 
student body of 368 had been examined by tuberculin test and 
chest X-ray. About a third of the students had a negative tubercu- 
lin test, another third had a positive test from natural sources, 
and the remainder were positive after BCG vaccination con- 
ducted 8 months previously. The outbreak was indisputably 
traced to a science teacher with a pulmonary lesion that was ac- 
tive intermittently. This teacher taught eight different grades 
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comprising 212 students of all ages between 10 and 18 years. Four 
other classes totaling 120 students used the same classroom in 
which she had taught, but after she had conducted classes. 
Finally, there were three classes (36 girls) who had none of 
these exposures to the teacher. Table 43, showing the cases of pul- 
monary tuberculosis that developed during the next five years, 
illustrates a design that could scarcely have been improved if 368 
guinea pigs had been distributed in three equal groups by vac- 
cination status and each with three levels of intensity of exposure. 
The effect of the BCG vaccination is striking in these data. 

However, it must be borne in mind that the study would 
hardly have been published but for the unusual nature of the 
circumstances that comprised this event. Outstanding among 
these is the extreme unlikelihood that a tuberculosis incidence 
of this magnitude would occur such a short time after a whole 
population had been tested for presence of infection. The very 
rarity of such events stamps them as unusual and brings into 
doubt the validity of generalizations based on the experience. 

Evidence derived from “natural” experiments rarely is con- 
vincing, and for evidence to be as convincing as that given in this 
example is extremely uncommon. ‘The major concern is, of course, 
the lack of randomization in the selection of the groups. It is ex- 
tremely difficult to rule out the possibility of serious bias in such 
selection. 

However, in comparison with field trials, the observation of 
such events is inexpensive, and eliminates the necessity of con- 
sidering ethical aspects inherent in experimentation. It would be 
against good scientific judgment to deny the usefulness of such 
observations. 


CAUTION 


Perhaps the greatest advance in medicine and public health in 
the last decade has been the acceptance of the role of scientifically 
designed trials in the testing of hypotheses. In the light of the 
many practical considerations, none of these trials may ever 
reach theoretical perfection, but it 1s no exaggeration to say that 
the acceptance of the principles involved indicates a revolution in 
medical thinking. 
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However, care is required to insure that the respect due the epi- 
demiologic experiment does not swell to the extent that obser- 
vational methods become ignored or disfavored. At times, even 
now, the results of sound observation are denied because experi- 
mental ‘‘proof” is lacking. We have pointed out previously that 
there are many situations where experiment is impossible and 
many more where it is impractical. Such situations are abundant 
in the areas with which epidemiology is now particularly con- 
cerning itself — the chronic diseases — and it is likely that, as has 
happened so frequently in the past, many effective control meas- 
ures will be instituted without experimental confirmation of hy- 
potheses based on observation. 
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